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PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Prior to examination of the present Application, Applicants respectfully request 
that the above-identified application be amended as follows: 

In the Claims: 

Please amend the following claims: 

Claim 5, line 1, delete "any one of claims 2 to 4" and insert 
-Claim 2-. 

Claim 7, line 1, delete "any one of claims 2 to 6" and insert 
-Claim 2-. 

Please add the following new claims: 

10. The gene according to claim 3, wherein the gene is derived from a 
microorganism belonging to the genus Mucor. 

1 1 . The gene according to claim 4, wherein the gene is derived from a 
microorganism belonging to the genus Mucor. 

1 2. A recombinant vector which comprises the gene according to claim 3. 
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1 3. A recombinant vector which comprises the gene according to claim 4. 



14. A recombinant vector which comprises the gene according to claim 5. 



1 5. A recombinant vector which comprises the gene according to claim 6. 
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DESCRIPTION 

Endo- 0 -N-Acetylglucosaminidase Gene 

Technical Field 

The present invention relates to a novel endo- j8 -N-acetylglucosaminidase gene, and 
specifically relates to a gene where said gene is derived from the genus Mucor. Further, the 
present invention relates to a recombinant plasmid which comprises said gene, an organism 
transformed with said plasmid, and a method of producing a novel endo- 0 -N- 
acetylglucosaminidase using said transformant. 

Background Art 

Glycoproteins are widely found in animal tissue, plant tissue, and the cell membrane and cell 
wall etc. of eukaryotic microorganisms. 

In recent years, it has become increasingly clear that the sugar chains of glycoproteins play an 
important role in mechanisms such as cell differentiation, carcinogenesis and intercellular 
recognition. In order to clarify these mechanisms, research into the correlation between sugar 
chain structure and its function is proceeding. As a means of achieving this objective, when 
cleaving a sugar chain from a glycoprotein or when identifying the structure of a sugar chain, 
various glycosidases are employed. Among these, endo- 0 -N-acetylglucosaminidase acts on 
the asparagine-linked sugar chain (N-linked sugar chain, N-sugar chain) and has the action of 
cleaving the diacetyl-chitobiose portion that exists within the sugar chain thereby liberating 
the sugar chain. 

Since endo- 0 -N-acetylglucosaminidase can liberate the sugar portion of a glycoprotein from 
the protein portion, it is thought to be important in the analysis of the function and structure of 
sugar chains in glycoproteins. 
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Asparagine-linked sugar chains may be classified by their structure as high mannose type 
(mannane type sugar chain), hybrid type, or complex type. 

Known endo- # -N-acetylglucosaminidases include Endo H (A. L. Tarentino and F. Maley, /. 
Biol. Chem., 249, 811 (1974)), Endo F (K. Takegawa, et al., Eur. j. Biochem., 202, 175 
(1991)), and EndoA (K. Takegawa, et al., Appl. Environ. Microbiol., 55, 3107 (1989)). 
However, these enzymes only act upon sugars with specific structures, and do not act upon 
glycoproteins except in the presence of a denaturing agent. 

Endo- j3 -N-acetylglucosaminidase derived from Mucor hiemalis is capable of cleaving tri- 
antennary complex type sugar chains in respect of not only high mannose type (mannane type 
sugar chain) and hybrid type, but also complex type chains. Further, with the asialylated type, 
cleavage ability extends to tetra-antennary heteroglycan chains. Further, it is known that it is 
possible to free sugar chains from glycoproteins without subjecting the protein to 
denaturization treatment. (S. Kadowaki, et al., Agric. Biol. Chem., 54, 97 (1990)). Therefore, 
endo- /3 -N-acetylglucosaminidase derived from Mucor hiemalis can be said to be useful in 
the research of the functional and physiological role of sugar chains and proteins of 
glycoproteins. 

On the other hand, conversion of mannane type sugar chains derived from yeast into sugar 
chains in a form compatible with humans, is extremely significant industrially. As methods 
for this conversion, in vivo conversion through improvement of the yeast sugar chain 
biosynthetic system by genetic manipulation, and in vitro conversion using the trans- 
glycosylation reaction can be considered. For the purpose of sugar conversion, endo- jS -N- 
acetylglucosaminidase is required to have as properties, 1) the substrate specificity, i.e an 
ability to cleave both mannane and complex types; and, 2) the ability to perform the trans- 
glycosylation reaction, which is the reverse reaction of the decomposition reaction. Therefore, 
endo- /? -N-acetylglucosaminidase derived from Mucor hiemalis can be said to be an 
appropriate enzyme for the practice of said conversion. 
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The present inventors, have proposed a sugar chain conversion technique using endo- /S -N- 
acetylglucosaminidase derived from Mucor hiemalis which can alter yeast sugar chains to 
human-compatible form. (Japanese Patent Application Laid-Open No: Hei 7-59587) 

To perform sugar chain conversions such as the above, an authentic enzyme preparation of 
high purity is required in great amounts. In this case, improvement of enzyme productivity by 
conventional breeding methods using mold cells has been considered. However, since 
conventional breeding methods are limited predominantly to the method of selecting such an 
enzyme from mutant strains obtained using ultraviolet light or mutagens, the isolation of 
stable mutants is difficult. Further, conventional breeding methods are often accompanied by 
unfavorable transformations. Further, since molds generally produce protease enzymes, they 
are unfavorable for production of an enzyme for the purpose of sugar conversion. Since, in 
order to overcome these problems it is necessary to proceed through a number of purification 
steps, the work is complicated and the yield is low. For example, culturing a microorganism 
belonging to the genus Mucor which is a type of filamentous mold, even if purification of the 
supernatant of this culture is performed, contamination with protease cannot be prevented and 
preparation in large amounts is difficult because of low enzyme productivity of the 
microorganism, and thus this method was of little practical value. 

Given the above, it is desired for the purpose of mass producing endo- 0 -N- 
acetylglucosaminidase, that the gene for said enzyme be obtained and produced through the 
use of genetic engineering. Further, if the gene can be obtained, it can be expected that an 
enzyme with improved heat resistance and pH resistance and increased reaction rate can be 
obtained using protein engineering techniques. However, there have been no reported 
attempts at gene cloning to date. 

Disclosure of Invention 

It is an object of the present invention to provide an endo- /3 -N-acetylglucosaminidase, an 
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endo- /S -N-acetylglucosaminidase gene, a recombinant vector which comprises said gene, an 
organism transformed with said vector, and a method of producing endo- /3 -N- 
acetylglucosaminidase. 

The present inventors, as a result of their intensive research directed to solving the above 
problem, based on partial amino acid sequence information of said endo- /3 -N- 
acetylglucosaminidase derived from Mucor hiemalis, have succeeded in obtaining the gene 
that encodes said enzyme from a cDNA library prepared from Mucor hiemalis which is a 
bacteria which produces said enzyme, have further succeeded in expressing this gene in yeast, 
and thereby completed the present invention. 

Thus, the present invention provides the recombinant protein of (a) or (b) below: 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO: 3; and, 

(b) a protein comprising an amino acid sequence derived from the amino acid 
sequence represented by SEQ ID NO: 3 by deletion, substitution, insertion, or 
addition of at least one amino acid and having the activity of endo- £ -N- 
acetylglucosaminidase. 

Further, the present invention provides a endo- # -N-acetylglucosaminidase gene which 
encodes the protein of (a) or (b) below, and a gene that hybridizes with said gene under 
stringent conditions, and which comprises DNA encoding a protein that has endo- /S -N- 
acetylglucosaminidase activity. 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO: 3; 
and, 

(b) a protein comprising an amino acid sequence derived from the amino acid 
sequence represented by SEQ ID NO: 3 by deletion, substitution, insertion, or 
addition of at least one amino acid and having the activity of endo- /3 -N- 
acetylglucosaminidase. 

Further, the present invention provides a gene comprising the DNA of (c) or (d) below: 
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(c) a DNA consisting of the nucleotide sequence represented by SEQ ID NO: 2; and, 

(d) a DNA which hybridizes under stringent conditions with a DNA consisting of the 
nucleotide sequence represented by SEQ ID NO: 2, and encodes a protein having 
the activity of endo- /3 -N-acetylglucosaminidase. 

Said gene includes a gene derived from a microorganism belonging to the genus Mucor. 

Further, the present invention provides a recombinant vector which comprises said gene. 

Further, the present invention provides a transformant which comprises said recombinant 
vector. 

Further, the present invention provides a method for producing an endo- -N- 
acetylglucosaminidase, comprising culturing said transformant, and collecting endo- # -N- 
acetylglucosaminidase from the obtained culture product. 

Below, the present invention will be described in detail. 

The present invention is characterized by culturing endo- /3 -N-acetylglucosaminidase 
producing microorganisms, purifying the endo- P -N-acetylglucosaminidase from the obtained 
culture, designing degenerate probes from a partial amino acid sequence of said enzyme, 
cloning a gene encoding said enzyme by performing PCR, and further, cloning a gene 
encoding said enzyme from a cDNA library of the microorganism producing endo- /3 -N- 
acetylglucosaminidase. Further, the present invention is characterized by obtaining a 
recombinant vector by introducing the cloned gene into a vector, as well as by obtaining a 
transformant by introducing said recombinant vector into a host cell. Further, the present 
invention is characterized by producing endo- /? -N-acetylglucosaminidase in large quantities 
by culturing said transformant. 

1. Culturing of endo- 0 -N-acetylglucosaminidase producing microorganisms 
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Microorganisms for producing endo- jS -N-acetylglucosaminidase include microorganisms 
belonging to the genus Mucor, preferably Mucor hiemalis, and more preferably the Mucor 
hiemalis deposited at the National Institute of Bioscience and Human-Technology, Agency of 
Industrial Science and Technology (1-1-3, Higashi, Tsukuba-shi, Ibaraki-ken, Japan) 
(Accession No. FERM BP-4991). 

The medium composition used in culturing of these strains may be of any kind typically used 
in the culture of microorganisms. 

Carbon sources include for example sugars such as glucose, sucrose, mannose, galactose, 
maltose, soluble starch and dextrin. Nitrogen sources include yeast extract, trypton, etc. As 
inorganic salts, apart from the inorganic salts contained in the above nitrogen sources, salts 
such as all varieties of sodium salts, potassium salts, calcium salts, magnesium salts and 
phosphate salts may be used. Vitamins may be added optionally. 

The culture medium is sterilized by a conventional method, and the strain is inoculated into 
the medium. Thereafter, a shake culture, or aeration-agitation culture is performed at 20-30°C, 
pH 5-7 for 2 to 4 days. 

In the present invention, culture is preferably performed at 25-30°C, pH 6, with galactose as 
the carbon source, yeast extract and trypton as the nitrogen sources, with concentrations of 
both carbon and nitrogen sources at 2-3% each, and the ratio of carbon source to nitrogen 
source at 2:3, for 3 to 4 days under good aeration conditions. Where the culture is performed 
under such conditions, the amount of enzyme produced is maximized, and in comparison to 
the known method (S. Kadowaki, et al., Agric. Biol. Chem., 54, 97 (1990); glucose 0.5%, 
yeast extract 1%, peptone 1 %), about 10-fold greater productivity can be achieved. 



Further, in the present invention, to ensure the aeration conditions when culturing the 
microorganisms, use of a jar fermenter is preferable. 



2. Purification of endo- ]3 -N-acetylglucosaminidase 

The endo- /3 -N-acetylglucosaminidase produced by the above bacterial strain is characterized 
by preservation of the following activity. In other words, it can be characterized by its activity 
to act on the asparagine-linked sugar chain in a glycoprotein, cleave the diacetyl-chitobiose 
portion within the sugar chain, and thereby liberate the sugar chain. 

Purification of endo- /? -N-acetylglucosaminidase can be performed by the appropriate 
combination of known methods for separation and purification. Examples of such methods 
include methods using differences in solubility such as salt precipitation and solvent 
precipitation; methods using differences in molecular weight such as dialysis, ultrafiltration, 
gel filtration and SDS-polyacrylamide electrophoresis; methods using differences in 
electrostatic charge such as ion exchange chromatography; methods using differences in 
hydrophobicity such as hydrophobic chromatography and reverse phase chromatography, and 
methods using differences in isoelectric point such as isoelectric focusing. 

In the present invention, by adopting a method of culturing which is an improvement over the 
above described known method (S. Kadowaki, et al., Agric Biol. Chem., 54, 97 (1990)), and 
subjecting to many purification steps, endo- /S -N-acetylglucosaminidase can be efficiently 
purified, and it is possible to obtain a sufficient amount of the protein to determine the amino 
acid sequence necessary to obtain the gene. The obtained enzyme, as a result of purification 
and as a result of the analysis of the gene to be described below, consists of a single gene 
product with a molecular weight of approximately 85,000, and after post-translational partial 
digestion of the gene product, it was found to be composed of 2 or more subunits including 
peptides with molecular weights of approximately 60,000 and 14,000. 

3. Cloning of the novel endo- jS -N-acetylglucosaminidase gene 

It was clear that the endo- 0 -N-acetylglucosaminidase obtained from Mucor hiemalis was 
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composed of at least 2 or more peptides. 



Generally, when isolating a gene encoding a specific protein, a partial amino acid sequence of 
the protein is determined, and it is possible to isolate the desired gene from a gene library with 
a mixture of oligonucleotides consisting of degenerative codons as a probe. Further, after 
obtaining a fragment by PCR such as in the present invention, it is possible to isolate the 
desired gene from a gene library using this fragment as a probe. 

However, since endo- /3 -N-acetylglucosaminidase is a hetero-oligomeric molecule consisting 
of 2 or more subunits, there is the possibility that each subunit is encoded independently by 
their separate differing genes. Further, even if endo- jS -N-acetylglucosaminidase is derived 
from one gene, its structure, for example, the positional relationship between the regions 
encoding the two subunits within the structural gene was unclear. 

Thus, the present inventors determined partial amino acid sequences for 2 subunits, and then 
after obtaining partial fragments by PCR, used said fragments as probes to achieve cDNA 
cloning and by analyzing the gene structure, clarified that the same gene coded for these 2 
subunits. That is, it was clarified that the novel endo- /3 -N-acetylglucosaminidase is produced 
as one polypeptide from the gene encoding this enzyme, and is processed by partial 
decomposition into 2 or more subunits. 

The gene of the present invention is cloned by, for example, the following method. 
(1) Cloning of the endo- 0 -N-acetylglucosaminidase gene 

In the present invention, an example of a DNA fragment comprising a gene encoding a novel 
endo- jS -N-acetylglucosaminidase is the DNA fragment indicated by the restriction enzyme 
map shown in Fig. 2. This fragment can be isolated using genetic engineering methods from a 
cDNA library with an mRNA template prepared from a microorganism belonging to the 
genus Mucor, preferably from a strain of Mucor hiemalis, and more preferably from the strain 
of Mucor hiemalis deposited under accession number FERM BP-4991 at the National 
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Institute of Bioscience and Human-Technology, (See the method described in, for example, 
Molecular Cloning: A Laboratory Manual (Sambrook, Maniatis et al, Cold Spring Harbour 
Laboratory Press (1989)) 

Preparation of mRNA can be performed according to a typical method. For example, after 
culturing the mRNA source, Mucor hiemalis, total RNA is obtained from the cultured cell 
with a kit available on the market (ISOGEN (Nippon Gene Company)), and then purified with 
a purification kit available on the market (mRNA Purification Kit (Pharmacia Biotech)). In 
the preparation of mRNA, it is preferable to keep the culturing time short to control the 
decomposition of mRNA. 

With the thus obtained mRNA as a template, a single-strand cDNA is synthesized using an 
oligo dT primer and a reverse transcriptase enzyme. Thereupon, a duplex cDNA is 
synthesized from said single stranded cDNA. A recombinant vector is then constructed by 
incorporating the duplex cDNA into a suitable cloning vector. A cDNA library can be 
obtained by transforming E.coli using the obtained recombinant vector, and selecting 
transformants using tetracycline resistance and ampicillin resistance as an indicator. 

Here, transformation of E.coli is performed according to a method such as the Hanahan 
method [Hanahan, D. : J. Mol. Biol. 166 : 557-580 (1983)]. When a plasmid is to be used as a 
vector, it is necessary to include a gene for resistance to a drug such as tetracycline or 
ampicillin. Further, a cloning vector other than a plasmid, for example, X phage or the like 
can be used. 

A strain having the desired DNA is selected (screened) from the thus obtained transformant. 
Screening methods include, for example, a method of synthesizing sense and antisense 
primers corresponding to a partial amino acid sequence of endo- jS -N-acetylglucosaminidase, 
and using this to perform a polymerase chain reaction (PCR). Template DNA may include for 
example, genomic DNA or cDNA synthesized by reverse transcription from the above- 
mentioned mRNA. As primers, in respect of the sense chain, for example, 5'- 
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C ARTTRC ARCCNGAYGAYAA-3 ' (SEQ ID NO: 5) synthesized on the basis of amino acid 
sequence: PSLQLQPDDK (SEQ ID NO: 4) and 5'-CCHACNGAYCARAAYATYAA-3' 
(SEQ ID NO: 7) synthesized on the basis of amino acid sequence: SYRNPEIYPTDQNIK 
(SEQ ID NO: 6), may be used. Further, in respect of the antisense chain 3'- 
GGDTGNCTRGT YTTRTARTT-5 ' (SEQ ID NO: 8) synthesized on the basis of amino acid 
sequence: SYRNPEIYPTDQNIK (SEQ ID NO: 6) and 3 ' -TT YCCDGT YGCD A ARTTRGT- 
5' (SEQ ID NO: 10) synthesized on the basis of amino acid sequence: 
GQRFNHRESHDVETEI (SEQ ID NO: 9), may be used. However, the present invention is 
not limited to these primers. 

Thus, the obtained DNA amplification fragment is labeled with for example 32 P, 35 S or biotin 
and taken as a probe, and is then made to hybridize with the cDNA library of the transformant, 
which library has been denatured and immobilized onto a nitrocellulose filter. Screening can 
then be performed by searching the obtained positive strains. 

(2) Determination of the nucleotide sequence 

Determination of the nucleotide sequence of the obtained clone is performed. Determination 
of the nucleotide sequence may be performed by known methods such as the Maxam-Gilbert 
chemical modification method or dideoxy method, though typically determination of the 
sequence is performed using an automated nucleotide sequencer. (For example, PERKTN- 
ELMER 377A DNA Sequencer) 

The full length sequence of the endo- j3 -N-acetylglucosaminidase gene is indicated by SEQ 
ID NO: 1. Therein, a preferable example of the gene of the present invention, is the nucleotide 
sequence from position 71 to position 2305 (SEQ ID NO: 2) of the nucleotide sequence 
indicated by SEQ ID NO: 1. Further, the gene of the present invention includes not only a 
sequence which encodes the amino acid sequence represented by SEQ ID NO: 3 or the below 
described amino acid sequence having an equivalent sequence, but also encompasses 
degenerate isomers encoding identical polypeptides which differ in respect of degenerate 
codons only. 
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A nucleotide sequence encoding an amino acid sequence having an equivalent sequence can 
be prepared using a method such as site-directed mutagenesis. That is, mutations may be 
introduced by known methods such as the Kunkel method or Gapped duplex method, or other 
methods equivalent thereto, using for example a mutation introduction kit that employs site- 
directed mutagenesis (e.g. Mutant-K (Takara), Mutant-G (Takara)) etc., or using Takara's LA 
PCR in vitro Mutagenesis series kit. 

Further, an endo- 0 -N-acetylglucosaminidase gene includes not only DNA consisting of the 
nucleotide sequence represented by SEQ ID NO: 1 or SEQ ID NO: 2, but also a DNA which 
hybridizes under stringent conditions with said DNA, and encodes a protein having the 
activity of endo-/3 -N-acetylglucosaminidase. Stringent conditions refer to, for example, the 
conditions of a sodium concentration of 50-300mM, preferably 150mM and a temperature of 
50-68°C, preferably 65°C. 

Once the nucleotide sequence of the endo- jS -N-acetylglucosaminidase gene (SEQ ID NO: 1) 
is identified, since the nucleotide sequence of the DNA fragment having the sequence from 
position 71 to 2305 of the said nucleotide sequence (open reading frame) is determined (SEQ 
ID NO: 2), it is possible to obtain the endo- 0 -N-acetylglucosaminidase gene by chemical 
synthesis; by PCR with genomic DNA as a template and the 5' and 3' terminal sequences of 
the open reading frame (SEQ ID NO: 2) (e.g. 5 '-ATGCCTTCACTTCAATTGC AACC-3 ' 
(SEQ ID NO: 11) and 5 '-CTAGTTTAATGAC AAATCTATGC-3 ' (SEQ ID NO: 12) as 
primers; or, by hybridization with a DNA fragment having the nucleotide sequence of an 
endo- £ -N-acetylglucosaminidase gene as a probe. 

The plasmid pZL-Endo (See Example 3 below) which comprises the gene of the present 
invention was introduced into E.coli DH10B (Title: DHBpZL-Endo) and deposited at 
National Institute of Bioscience and Human-Technology (1-1-3, Higashi, Tsukuba-shi, 
Ibaraki-ken, Japan) on April 28, 1998 under accession number FERM BP-6335. 
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In the present invention, the amino acid sequence represented by SEQ ID NO: 3 or a 
polypeptide comprising an equivalent sequence, is provided as a preferable example of a 
recombinant novel endo- 0 -N-acetylglucosaminidase. Here, "equivalent sequence" refers to a 
sequence which comprises the amino acid sequence represented by SEQ ID NO: 3 and in 
which at least one amino acid is inserted, substituted, deleted or added to either end, and 
which retains said novel endo- 0 -N-acetylglucosaminidase activity. Retention of novel endo- 
iS -N-acetylglucosaminidase activity in this equivalent sequence means that the sequence 
maintains activity sufficient for it to be used in an almost identical manner under identical 
conditions to the polypeptide having the full sequence represented by SEQ ID NO: 3 in actual 
forms of use which exploit this activity. It is clear that it is possible for a person skilled in the 
art to select and produce with no particular difficulty such an equivalent sequence, referring to 
the sequence represented by SEQ ID NO: 3. For example, within the amino acid sequence 
represented by SEQ ID NO: 3, at least 1, preferably 1 to 10, more preferably 1 to 5 amino 
acids may be deleted; at least 1, preferably 1 to 10, more preferably 1 to 5 amino acids may be 
added or inserted; or, at least 1, preferably 1 to 10, more preferably 1 to 5 amino acids may be 
substituted. Accordingly, the protein of the present invention includes a polypeptide having 
the amino acid sequence from position 2 to 744 of the amino acid sequence represented by 
SEQ ED NO: 3 in the Sequence Listing (one in which the methoinine at position 1 of the 
amino acid sequence represented by SEQ ID NO: 3 has been deleted.) 

Here, through the partial amino acid sequence analysis of the present invention and gene 
structure analysis, it became clear that 2 or more natural type subunits were produced by 
cleavage of a precursor polypeptide on the C terminal side of at least the histidine at position 
510 and the asparagine at position 627 in the amino acid sequence represented by SEQ ED 
NO: 3. 

2. Construction of a recombinant vector and transformant 

The present invention provides a DNA molecule comprising the gene of the present invention, 
in particular, an expression vector. This DNA molecule can be obtained by incorporating a 
DNA fragment encoding the novel endo- /3 -N-acetylglucosaminidase according to the present 
invention into a vector molecule. Accordingly, if transformation of a host cell is performed 
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with a DNA molecule, particularly in the form of an expression vector, which includes a DNA 
fragment encoding the novel endo- /3 -N-acetylglucosaminidase of the present invention in a 
form such that it is replicable within the host cell and the said gene is expressible, it is 
possible to cause production of the novel endo- /S -N-acetylglucosaminidase of the present 
invention in the host cell. 

The DNA molecule according to this invention can be constructed based on the method 
described in the above-referenced Molecular Cloning: A Laboratory Manual (supra). 

(1) Construction of a recombinant vector 

The vector to be used in the present invention may be appropriately selected from a virus, 
plasmid, cosmid vector or the like in consideration of the type of host cell to be used. 

For example, where the host cell is E.coli, bacteriophages of the X phage line, plasmids of 
the pBR line (pBR322, pBR325 etc.) and pUC line (pUC118, pUC119 etc); where the host 
cell is Bacillus subtilis, plasmids of the pUB line (pUBHO etc); and where the host cell is 
yeast, vectors of the YEp and YCp lines (e.g. YEpl3, YEp24, YCp50 etc.), or pG-3-Not used 
in the Examples below, may be used. Further, animal viruses such as a retrovirus or vaccinia 
virus, or insect virus vectors such baculovirus can be used. 

To introduce the gene of the present invention into the vector, methods for ligating the gene to 
the vector such as first cleaving the purified DNA with a suitable restriction enzyme, and then 
inserting the gene at suitable restriction sites or multicloning sites of the vector DNA, are 
employed. 

It is necessary that the gene of the present invention be incorporated into the vector such that 
the function of this gene is exhibited. Thus, it is preferable that the vector of the present 
invention includes a selective marker. Drug-resistant markers and auxotrophic markers can be 
used as selective markers. 
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Further, it is preferable that the DNA molecule to be used as the expression vector of the 
present invention has DNA sequences necessary for the expression of the novel endo- $ -N- 
acetylglucosaminidase gene such as transcription regulating signals and translation regulating 
signals such as, for example, a promoter, transcription initiation signal, ribosome binding site, 
translation stop signal and a transcription termination signal. 

(2) Construction of the transformant 

The transformant of the present invention can be obtained by introducing the recombinant 
vector of the present invention into a host in a manner allowing expression of the subject gene. 
There is no particular limitation on the host that may be used as long as it allows expression 
of the gene of the present invention. Examples include bacteria of the genus Escherichia such 
as Escherichia coli, of the genus Bacillus such as Bacillus subtilis, or of the genus 
Pseudomonas such as Pseodomonas putida etc., and yeasts such as Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Candida boidinii and Pichia pastoris. 

As host cells, apart from E.coli, B. subtilis, and yeast, animal cells such as COS cells and 
CHO cells etc., and insect cells such as Sf9 and Sf21 etc. may be used. 

Where a bacterium such as E.coli is used as a host, the recombinant vector of the present 
invention preferably is autonomously replicable within the bacterium and comprises a 
promoter, ribosome binding site, the gene of the present invention, and a transcriptional 
termination sequence. A gene which controls the promoter may also be included. 

Examples of E. coli include Escherichia coli K12, DH1, DH5a, JM109 etc. Examples of 
Bacillus subtilis include Bacillus subtilis MI 114, 207-21 etc. It is known that there exist 
strains of Bacillus subtilis that secrete proteins out of the microorganism body. There are also 
known strains that secrete hardly any protease. The use of such strains as hosts is preferable. 

As a promoter, a promoter within the inserted fragment that is able to function even in the 
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host, may of course be used. Examples of promoters in E.coli include lactose operon (lac) and 
tryptophan operon (trp), etc. 

As a method for introducing the recombinant vector into bacteria, any method for introducing 
DNA into bacteria may be employed and there is no particular limitation thereon. Examples 
include a method using calcium ions [Cohen, S.N. et al.: Proc. Natl. Acad. ScL, USA, 
69:2110-2114(1972)], and electroporation. 

Where yeast is to be the host, Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Candida utilis, etc., may be used. In this case, there is no particular limitation on the promoter 
that may be used as long as it can express in yeast. For example, promoters such as alcohol 
dehydrogenase (ADH), acidic phosphatase (PHO), galactose gene (GAL),and glyceraldehyde- 
3-phosphate dehydrogenase gene (GAPDH) promoters, heat shock protein promoter, MFal 
promoter, PGK promoter, GAP promoter and AOX1 promoter may preferably be used. 

As a method for introducing the recombinant vector into yeast, any method for introducing 
DNA into yeast may be employed and there is no particular limitation thereon. Examples 
include electroporation [Becker, D.M. et al.: Methods. Enzymol., 194: 182-187 (1990)], 
spheroplast method [Hinnen, A. et al.: Proc. Natl. Acad. Sci., USA, 75: 1929-1933 (1978)], 
and lithium acetate method [Itoh, H. : J. BacterioL, 153:163-168 (1983)]. 

Where an animal cell is to be the host, monkey cell COS-7, Vera, Chinese hamster ovary cell 
(CHO cell), mouse L cell, rat GH3 and human FL cell, etc. may be used. As a promoter, SR a 
promoter, SV40 promoter, LTR promoter, CMV promoter or the like can be used. Also, the 
initial promoter of human cytomegalovirus or the like may be used. 

Examples of a method for introducing the recombinant vector into an animal cell include 
electroporation, calcium phosphate method and lipofection method. 

Where an insect cell is to be the host, Sf9 cell, Sf21 cell and the like may be used. 
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Examples of a method for introducing the recombinant vector into an insect cell include 
calcium phosphate method, lipofection, and electroporation. 

4. Production of the protein of the present invention 

A protein of the present invention has an amino acid sequence encoded by a gene of the 
present invention, or has said amino acid sequence into which said modification of at least 1 
amino acid has been introduced, and has the activity of endo- jS -N-acetylglucosaminidase. 

The protein of the present invention can be obtained by culturing the above-mentioned 
transformant and collecting the protein from this culture product. "Culture product" refers to 
either the culture supernatant, the cultured cells or microbial cells, or disrupted cells or 
microbial cells. 

Either a natural medium or synthetic medium may be used as a medium for culturing 
transformants obtained with microorganisms such as E. coli or yeast as hosts, as long as it 
contains a carbon source, a nitrogen source, inorganic salts and the like, which are able to be 
assimilated by the microorganism and it may be used to efficiently perform a culture of the 
transformant. 

As carbon sources, carbohydrates such as glucose, fructose, sucrose and starch, organic acids 
such as acetic acid and propionic acid, and alcohols such as ethanol and propanol are used. 

As nitrogen sources, inorganic acids such as ammonia, ammonium chloride, ammonium 
sulfate, ammonium phosphate, organic acids such as ammonium acetate, or other nitrogen 
containing compounds, as well as peptone, meat extract, corn steep liquor or the like are used. 

As inorganic matter, potassium (I) phosphate, potassium (H) phosphate, magnesium 
phosphate, magnesium sulfate, sodium chloride, iron (I) sulfate, manganese sulfate, copper 
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sulfate and calcium carbonate, etc are used. 



Culturing is typically performed under aerobic conditions such as a shake culture or aeration- 
agitation culture, at 37°C at 12 to 72 hours. During culturing, pH is maintained at 4-7.5. 
Regulation of pH is performed using inorganic or organic acid or alkali solutions and the like. 

During culturing, antibiotics such as ampicillin and tetracycline may be added to the medium 
as required. 

When the microorganism transformed with an expression vector containing an inducible 
promoter is cultured, an inducer may be added to the culture as required. For example, when 
culturing a microorganism transformed with an expression vector which uses a Lac promoter, 
isopropyl- # -D-thiogalactoside (IPTG) and the like may be added to the culture, and when 
culturing a microorganism transformed with an expression vector using a trp promoter, indole 
acetic acid (IAA) may be added. 

As culture media for culturing a transformant obtained from an animal host cell, the generally 
used RPMI1640 medium, DMEM medium or these medium to which fetal calf serum and the 
like have been added, are used. 

Culturing is typically performed over 2 to 10 days at 37°C in the presence of 5% CO z . 
Antibiotics such as kanamycin and penicillin and the like may be added during culturing as 
required. 

After culturing, when the protein of the present invention has been produced within the 
microorganisms or cells, the protein of the present invention is extracted by disrupting the 
microorganisms or cells. When the protein of the present invention has been produced outside 
the microorganisms or cells, the culture fluid may be used as is, or the microorganisms or 
cells removed by centrifugal isolation, etc. 
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Purification of the recombinant novel endo- 0 -N-acetylglucosaminidase, is performed by the 
suitable combination of known separation and purification methods. Examples of such 
methods include methods using differences in solubility such as salt precipitation, and solvent 
precipitation methods; methods using differences in molecular weight such as dialysis, 
ultrafiltration, gel filtration and SDS-polyacryl electrophoresis; methods using differences in 
electrostatic charge (valence) such as ion exchange chromatography; methods using 
differences in hydrophobicity such as hydrophobic chromatography and reverse phase 
chromatography; and, methods using differences in isoelectric point such as isoelectric 
focusing. 

In the present invention, as indicated in the Examples below, when this gene was made to 
express under the control of GAPDH promoter, in a Saccharomyces cerevisiae host, high 
enzyme activity was confirmed within the cell extract. This indicated that it was possible to 
produce active novel endo- 0 -N-acetylglucosaminidase in large quantities through the 
expression of the gene of the present invention in the recombinant. 

Brief Description of Drawings 

Figure 1 is an electrophoresis photograph indicating results for purified endo- 0 -N- 
acetylglucosaminidase. 

Figure 2 is a restriction enzyme map of pZL-Endo which comprises the full length sequence 
of a novel endo- 0 -N-acetylglucosaminidase gene. 

Figure 3 indicates the entire nucleotide sequence of the fragment inserted at the Sal I-Not I 
sites of pZL-Endo which comprises the full length sequence of a novel endo- 0 -N- 
acetylglucosaminidase gene. 

Figure 4 indicates the entire nucleotide sequence of the fragment inserted at the Sal I-Not I 
sites of pZL-Endo which comprises the full length sequence of a novel endo- 0 -N- 
acetylglucosaminidase gene. (Continuation of Figure 3) 

Figure 5 shows the deduced amino acid sequence from novel endo- 0 -N- 
acetylglucosaminidase gene and the nucleotide sequence of a DNA encoding said amino acid 
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sequence. 

Figure 6 shows the deduced amino acid sequence from novel endo- 0 -N- 
acetylglucosaminidase gene and the nucleotide sequence of a DNA encoding said amino acid 
sequence. (Continuation of Figure 5) 

Figure 7 shows the deduced amino acid sequence from novel endo- # -N- 
acetylglucosaminidase gene and the nucleotide sequence of a DNA encoding said amino acid 
sequence. (Continuation of Figure 6) 

Figure 8 shows the structure of expression vector pGEndo-SC , which comprises a novel endo- 
/3 -N-acetylglucosaminidase gene for use in Saccharomyces cerevisiae. 

Figure 9 is a chromatography photograph which indicates the expression of novel endo- 0 -N- 
acetylglucosaminidase in yeast into which said enzyme gene has been introduced. 

Best Mode for Carrying Out the Invention 

Below, the present invention will be described more specifically using examples. It is 
contemplated, however, that the technical scope of the present invention is not to be limited to 
these examples. Unless otherwise indicated herein the procedures were performed in 
accordance with the methods as are described in Molecular Cloning: A Laboratory Manual 
(Sambrook, Maniatis et al, Cold Spring Harbour Laboratory Press (1989)). 

Example 1: Measurement of Enzyme Activity 

Measurement of the activity of endo- $ -N-acetylglucosaminidase was performed 
fundamentally in accordance with the method indicated in S. Kadowaki, et al., Agric. Biol. 
Chem., 54, 97 (1990). That is, the reaction was performed using dansylated human asialo- 
transferrin glycopeptide (DNS-GP) as a substrate in a potassium phosphate buffer (pH6.0) at 
37°C and activity measured by thin layer chromatography (TLC) or by HPLC under the 
following conditions. 
Analysis conditions for TLC: 

Development phase: HPTLC silica gel 60 (Merck) 

Solvent: butanol: acetic acid: water = 2:1:1 
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Detection: detection by fluorescence method 
Analysis conditions for HPLC: 

Column: TSK-gel ODS80TM (TOSO) 

Solvent: 25mM sodium borate buffer (pH 7.5) +11% acetonitrile 
Column temperature: 50°C 
Flow rate: 0.5ml/minute 
Detector: fluorescence detector 

Activity was defined such that 1 unit is equivalent to the amount of enzyme required to 
produce 1 jet mol of dansylated asparagyl acetylglucosamine in one minute under the above 
described HPLC measurement conditions. 

Example 2: Culturing Mucor hiemalis 

A medium (galactose 2%, yeast extract 3%) 100ml was placed in a 500ml Sakaguchi flask, 
and inoculated with a third to a fifth of a slant of Mucor heimalis spore, and culture was 
performed at 28°C for 2 days. The microbial cells separated from the culture by suction 
filtration were used in the preparation of mRNA. 

Regarding preparation of the enzyme, the above culture, following cultivation, was 
transferred to 2 liter of the medium in a 3 liter jar fermenter, and culturing was performed for 
four days under conditions of 28°C, rotation speed of 300-400rpm, and aeration volume of 2 
liters per minute. 

Example 3: Purification of novel endo- /3 -N-acetylglucosaminidase 

Four liters of the culture (from two batches of cultures performed in 3-liter jar fermenter) 
obtained in Example 2 was subjected to suction filtration to separate the cells, and 
concentrated to 200ml using ultrafiltration. This crude enzyme solution was subjected to ion 
exchange chromatography (Pharmacia Q Sepharose FF, 500ml) equilibrated with lOmM 
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potassium phosphate buffer containing 5mM EDTA. The column was washed with the same 
buffer solution followed by elution of endo- j3 -N-acetylglucosaminidase with a linear 
gradient of 900ml 0M-0.3M NaCl. Reagents were added to the active fraction such that the 
final concentrations of 1M ammonium sulfate, 50mM potassium phosphate containing 5mM 
EDTA were attained. The product was subjected to hydrophobic chromatography (TOSO 
Phenyl-TOYOPEARL 650S 200ml) equilibrated with the same buffer. The column was 
washed with this same buffer and then endo- B -N-acetylglucosaminidase was eluted with a 
linear gradient of 1M-0M ammonium sulfate containing ImM EDTA(600ml). 

The obtained eluate was concentrated to 5ml with ultrafiltration membrane (molecular weight 
cut-off 13000) then washed and desalted with lOmM potassium phosphate buffer containing 
ImM EDTA and 0.15M NaCl (pH7.0). Next, the active fraction was applied to a gel filtration 
chromatography column (Pharmacia Sephacryl S300) equilibrated with the same buffer 
solution, and endo- jS -N-acetylglucosaminidase was eluted with the same buffer solution. 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
13000) then washed and desalted with lOmM potassium phosphate containing ImM EDTA 
(pH7.0). Next, the active fraction was subjected to hydroxy apatite chromatography (TOSO 
TSK-gel HA1000) equilibrated with the same buffer. The column was washed with this same 
buffer and then endo- @ -N-acetylglucosaminidase was eluted with a linear gradient of 0M- 
0.3M ammonium sulfate containing ImM of EDTA (pH7.0)(30ml). 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
13000) then washed and desalted with a 25mM bis-tris buffer solution adjusted to pH 7.1 with 
iminodiacetic acid. The active fraction was subjected to isoelectric point chromatography 
(Pharmacia, MonoP) with equilibrated with the same buffer solution. The column was washed 
with this same buffer and then endo- jS -N-acetylglucosaminidase was eluted with 50ml of 
10% Polybuffer 74 (Pharmacia) adjusted to pH 3.9 with iminodiacetic acid. 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
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13000) then washed and desalted with lOmM potassium phosphate buffer containing ImM 
EDTA (pH7.0). Next, the active fraction was subjected to ion exchange chromatography 
(Pharmacia, MonoQ). The column was washed with this same buffer and then endo-/3 -N- 
acetylglucosaminidase was eluted with a linear gradient of 0M-0.3M NaCl (30ml). 

The active fraction was concentrated with ultrafiltration membrane (molecular weight cut-off 
13000) then washed and desalted with 50mM potassium phosphate buffer containing ImM 
EDTA (pH7.0), to obtain an enzyme sample. It should be noted that each column 
chromatography was performed using FPLC (Pharmacia). 

The mass of the protein was measured using Protein Assay Kit manufactured by BioRad, or 
by absorbance (280nm). The molecular weight and isoelectric point of the protein was 
measured by SDS-PAGE (15-25% gradient), gel filtration chromatography, IEF-PAGE and 
the like. 

From the activity of each of the fractions in two-dimensional electrophoresis by Native- 
SDSPAGE and IEF-SDSPAGE, and in the above chromatography, and from the results of 
SDS-PAGE analysis, bands of at least 60kDa (denoted p60) and 14kDa (denoted pl4) on 
SDS-PAGE were detected (Fig. 1). 

Example 4: Determination of a partial amino acid sequence of novel endo- /3 -N- 
acetylglucosaminidase 

Partial amino acid sequence analysis was performed according to the Iwamatsu method 
(Seikagaku (Biochemistry) 63, 139-143 (1991)). The purified enzyme was suspended in an 
electrophoresis buffer solution (10% glycerol, 2.5% SDS, 2% 2-mercaptoethanol, 62mM Tris 
HC1 buffer (pH6.8)), and was subjected to SDS polyacrylamide electrophoresis. After 
electrophoresis, said enzyme was transferred from the gel to 10cm by 7cm PVDF membrane 
((ProBlot) Applied Biosystems) by electroblotting. Using ZARUTO Blot Type Us 
(Zarutorius) as an electroblotting device, electroblotting was preformed at 160mA for 1 hour. 
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After transfer, the portion of the membrane onto which the said enzyme was transferred was 
cut out and part of this portion was directly analyzed with a gas phase protein sequencer 
whereby the N-terminal amino acid sequence was determined. Further, the remaining 
membrane was soaked in 300 # 1 of a reducing buffer solution (8M guanidine-HCl, 0.5M Tris- 
HC1 buffer (pH8.5), 0.3%EDTA, 2% acetonitrile), lmg of ditiothreitol (DTT) was added, and 
reduction was performed for about 1 hour at 25°C in the presence of argon. To this, 3.0mg of 
monoiodoacetic acid dissolved in 10 #1 of 0.5N sodium hydroxide solution was added and 
was stirred for 20 minutes under darkened conditions. PVDF membrane was taken out, and 
after thorough enough washing with 2% acetonitrile, it was soaked in lOOmM acetic acid 
containing 0.5% polyvinylpyrrolidone-40, and allowed to stand for 30 minutes. Afterward, 
the PVDF membrane was washed thoroughly with water. A 1mm square was cut from the 
membrane and soaked with a digestion buffer (8% acetonitrile, 90mM Tris-HCl buffer pH 
9.0) and 1 pmol of Acromobacter protease I (Wako Pure Chemicals Industries) was added 
thereto. The enzyme was then digested for 15 hours at room temperature. The digestion 
product was separated by reversed phase high performance liquid chromatography on C18 
column (Wakosil AR n C18 300 A 2.0 x 150mm(Wako Pure Chemicals Industries)), and 
seven peptide fragments were obtained in respect of each subunit. 

As elution solvents for the peptides, solvent A (0.05% trifluoroacetic acid) and solvent B (2- 
propanol/acetonitrile solution 7:3, containing 0.02% trifluoroacetic acid) were used, and the 
elution was performed for 40 minutes with a linear concentration gradient from 2 to 50% of 
solvent B, at a rate of 0.25mIVmin. 

Amino acid sequences of fragmented peptides obtained from the novel endo- /3 -N- 
acetylglucosaminidase candidate protein were analyzed. Fragments derived from p60 and 
fragments derived from pi 4, were denoted as p60-AP and pl4-AP, respectively. Sequencing 
of the obtained fragmented peptides was carried out using a gas phase protein sequencer 
PPSO-10 (Shimadzu) by the Edman degradation method in accordance with the manual. 



23 



The obtained partial amino acid sequences are indicated in Table 1. 



Table 1 - Partial amino acid sequence of endo- /? -N-acetylglucosaminidase candidate protein 



P 60 

P 60-AP-5 
17) 

p60-AP-6 
p60-AP-8 
p60-AP-9 
20) 

p60-AP-ll 
21) 

pl4 

pl4-AP-l 



PSLQLQPDDK 

(K) SYRNPEIYPtDQNIK 
(K) FNVSSVALQPRVK 
(K) MDRLFLCGgK 

S 

(K) GQRFNHREShDVETEI 

mal p lilt 
(K) EGYIS SSGSIDLSLN 



(SEQ ID NO: 

(SEQIDNO: 18) 
(SEQ ID NO: 19) 

(SEQ ID NO: 



(SEQ ID NO: 



(SEQ ID NO: 22) 



In the amino acid sequence described in Table 1, amino acids indicated by lower case letters 
of the alphabet are undetermined amino acids within the amino acid sequence. 

Since Acromobacter protease I, used on the partial amino acid sequence, specifically cleaves 
on the carboxyl side of a lysine residue, K (lysine) has been written in brackets on the N- 
terminus side of the sequences below. 

Since the p60-AP-5 was found to be the N-terminus amino acid sequence, K (Lysinse) in 
brackets was removed. 

In respect of the Acromobacter protease I digestion products of p60 and pl4, molecular 
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weight analysis was also performed with on-line mass analyzer (PE Sciex API-HI), combined 
with reverse phase high performance liquid chromatography (Hitachi L6200) on C-18 column 
(GL Science Inertsil ODS-3 0.5 x 40mm). The results of analysis are shown in Table 2. 

Table 2 Results of /Lys-C digest product on the 60kDa peptide (p60) and the 14kDa 



peptide (pi 4) by LC/MS analysis 



?60 


Measured 
Value 


Ideal 
Value 


Error 
Margin 


Corresponding 
Sequence 


SEQ 

ID 

NO: 


AP-1 
\P-2 
\P-3 
AP-4 
AP-5 

AP-6 
AP-7 

AP-8 
AP-9 

AP-1 
AP-1 


0: 
1: 


950.50 
1160.50 

733.25 
1838.50 
1141.00 
1157.50 
1774.75 

701.50 
1544.50 
1621.00 
1444.75 

945.75 
2655.00 
2206.75 
2335.00 


950.47 
1160.56 

733.41 
1837.91 
1140.59 
1556.75 
1774.94 

701.39 
1543.79 
1620.73 
1444.83 

945.58 
2655.33 
2206.11 
2335.16 


+0.03 
-0.06 
-0.16 
+0.59 
+0.41 
+0.75 
-0.19 
+0.11 
+0.71 
+0.27 
-0.08 
+0.17 
-0.33 
+0.64 
-0.16 


(K)NIQGNNYK 

(K)YSDYPPPPPK 

(K)LSLDASK 

(K)SYRNPEIYPTDQNIK 

( )PSLQLQPDDK....p60 N-terminus 

(K)NTDGIFLNYWWK 

(K)GB*SLRYIYRTLLMK 

(K)LTVAB*H p60 C-terminus 

(K)PQLLLTHDMAGGYK 
(K) SMNELRDWTPDEK 
(K)FNVS SVALQPRVK 
(K)LAPVSFALK 

(K)GQRFNHRESHDVETEISIPLYK 

(K)ITSSLDB*DHGAFLGGTSL1IK 

(K)NELFFKNTDGIFLNYWWK 


23 
24 
25 
18 
17 
26 
27 

29 
30 
19 
31 
32 
33 
34 


d14 


Measured 
Value 


Ideal 
Value 


Error 
Margin 


Corresponding 
Sequence 


SEQ 

ID 

NO: 


AP-1: 
AP-2: 
AP-3: 


888.75 
1392.50 
1541.50 
1608.50 


888.45 
1392.76 
1541.73 
1608.84 


+0.30 
-0.26 
-0.23 
-0.34 


(K)IVIEAVNK 

( )SSRIIQDLFWK pl4 N-terminus 

(K)EGYISSSGSIDLSLN..pl4 C-terminus 
1 (K)TDSSRIIQDLFWK 


35 
36 
22 
37 



* B represents cysteine, carboxymethyl. 



In Table 2, the fragment having a measured value of 701.50 in respect of mass (M+H+) is 
p60-AP-7, and the fragment having a measured value of 1541.50 is nearly identical to the 
molecular weight of pl4-AP-3, and it was found that the amino acid at this C-terminus was 
not K (lysine). Regarding the fragments digested with Acromobacter protease I, since, due to 
the substrate specificity of this enzyme, fragments other than C-terminus fragment of the 
subunit itself, will have a C-terminus amino acid residue that is lysine (K), it is clear that 
these fragmented peptides were the C-terminus fragments of the subunits p60 and pl4. 
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A homology search on the determined p60 and pl4 partial amino acid sequences using the 
protein database BLASTP indicated that the obtained sequences were novel. From the above 
results, gene cloning of p60 and pl4 was performed selecting the p60 and pl4 as endo- $ -N- 
acetylglucosaminidase candidates. 



Example 5: Construction of a Mucor hiemalis strain cDNA library. 



Firstly, total RNA was extracted from 5g of the microorganisms obtained in Example 2 using 
ISOGEN (Nippon Gene, Inc.). From the extracted total RNA, mRNA was purified using an 
mRNA Purification Kit (Pharmacia Biotech). cDNA was synthesized from the mRNA using a 
Superscript™ Lambda System for cDNA Synthesis and X Cloning Kit (GIBCO BRL), and 
then ligated to a Sal I adapter, and finally linked (or ligated) to X ZipLox™Sal I-Not I Arms 
(GIBCO BRL). Packaging was performed using Gigapack HI Gold Packaging Extract 
(Stratagene), for infection of E.coli Y1090 strain thereby constructing the cDNA library. 

Example 6: Cloning of novel endo- $ -N-acetylglucosaminidase cDNA. 

PCR primers were designed based on the partial amino acid sequences p60-AP-5, p60-AP-6, 
p60-AP-ll. These sequences are indicated below. The symbols used herein are all based on 
IUPAC-IUB. 



p60-AP-5 

p60-AP-5F 

p60-AP-6 

p60-AP-6F 

p60-AP-6R 

P 60-AP-11 

P 60-AP-11R 



5' CARTTRCARCCNGAYGAYAA 3' (sense primer) (SEQ ID NO: 5) 

5' CCHACNGAYCARAAYATYAA 3' (sense primer) (SEQ ID NO: 7) 
3' GGDTGNCTRGTYTTRTARTT 5' (antisense primer) (SEQ ID NO: 8) 

3' TTYCCDGTYGCDAARTTRGT 5' (antisense primer) (SEQ ID NO: 10) 
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Genomic DNA was purified from Mucor hiemalis culture mass by the phenol method and 
when genomic PCR (30 seconds at 94°C, 1 minute at 55°C,1 minute at 72°C, 30 cycles) was 
performed, specific amplified bands were confirmed. Regarding p60, with a combination of 
p60-AP-5F and p60-AP-HR primers, a PCR fragment of 1.7kb, with combination of p60-AP- 
5F and p60-AP-6R, a PCR fragment of 1.5kb, and with a combination of p60-AP-6F and p60- 
AP-11R primers, a PCR fragment of 0.2kb was obtained. Regarding this fragment, subcloning 
into pCR-Script Amp was performed using a pCR-Script cloning kit (Stratagene). It was 
predicted from analysis by restriction enzyme digestion, that the amplification fragment of 
p60-AP-5F and p60-AP-HR included the amplification fragments of p60-AP-5F and p60-AP- 
6R, and p60-AP-6F and p60-AP-HR. Thus, the nucleotide sequence of the amplification 
fragment of p60-AP-5F and p60-AP-HR was determined using PRISM Ready Reaction kit 
(Applied Biosystems) and PRISM 3 7 7 DNA sequencer (Applied Biosystems). Gene 
analysis was performed using DNASIS (Hitachi Software Engineering Co., Ltd.) etc. 

As a result, the amplification fragment of p60-AP-5F and p60-AP-HR included the 
determined other partial amino acid sequence. Therefore, it was determined that this fragment 
was a part of the p60 gene. Thus, new DNA primers were constructed based on the inside 
sequence of the PCR amplification fragment, and RT-PCR was then performed (under the 
same conditions as genomic PCR), using Access RT-PCR System (Promega) and with the 
mRNA obtained in Example 5 as a template. The sequences of the newly constructed DNA 
primers are as follows: 

p60-AP-5NF 5' CACTTAAGTCTATGAATGAG 3' (sense primer) (SEQ ID NO: 13) 
p60-AP-6NR 3' CGATAGCTTTAGGTCTCTAA 5' (antisense primer) (SEQ ID NO: 14) 

As a result, a fragment of approximately 1.2kb was amplified. Upon sequencing the amplified 
fragment it was found that a fragment not including introns had been obtained. Therefore, 
cDNA cloning was performed using this fragment as a probe. Labeling was performed with 
a - 32 P dCTP (HOTBq/mmol) using Megaprime DNA labeling systems (Amersham). 
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The full-length gene was obtained from the cDNA library, obtained in Example 5, by plaque 
hybridization. As a result, 5 positive clones were obtained from 200,000 plaques. Of these, 4 
clones were subjected to secondary screening thereby obtaining a single plaque. Further, 
E.coli DH10B was infected with phage fluid obtained from the plaque, and a plasmid derived 
from pZLl was recovered from the phage. Restriction enzyme analysis was performed in 
respect of these clones, and nucleotide sequence analysis of the clone comprising the longest 
upstream region was performed. This plasmid was named pZL-Endo. (Fig. 2) 

The nucleotide sequence of the inserted Sal I-Not I fragment of about 2.3kb was determined. 
In other words, the restricted fragments were subcloned into pBluescript II KS+ (Stratagene) 
or pUC118 (Takara Shuzo), and by producing a continuous deletion mutant using exonuclease 
III and mung bean nuclease, a plasmid having a variety of deletion mutations was constructed, 
and the sequence of a Sal I-Not I fragment consisting of 2370bp was determined using a DNA 
sequencer. (Figs. 3-4, SEQ ID NO: 1) 

Analysis of the predicted region of the structural gene was performed and it was found that 
there existed an open reading frame encoding an amino acid sequence consisting from 744 
amino acid residues (deduced molecular weight 85kDa, Figs 5-7, SEQ ID NO: 2), and this 
amino acid sequence comprised all of the determined partial amino acid sequences of p60 and 
pl4. Since the amino acid next to p60-AP-5 on the N-terminus side was not lysine but 
methionine, it was confirmed that ATG encoding this methionine was the start codon for 
translation. It had therefore been clarified that the N-terminus of the enzyme of the present 
invention was proline. 

On the other hand, as with the result of qualitative analysis, it was found that pl4-AP-3 was 
the C-terminus of the protein encoded by the gene of the present invention. Further, together 
with the results of mass analysis, it was predicted that at least one type of pl4 N-terminus was 
serine, the amino acid at position 628 in the amino acid sequence indicated in SEQ ID NO: 2. 

From the above, it was clear that the 5' region of the gene of the present invention encodes 
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p60, and the 3' region encodes pl4. Since the N-terminal signal sequence could not be found 
in the amino acid sequence, the enzyme of the present invention is thought to be an 
intracellular protein. However, since there exists a plurality of bands in Figure 1, the enzyme 
of the present invention is thought to be affected by action of a protease due to the lysis of the 
cell. 

Example 7: Construction of the endo- /3 -N-acetylglucosaminidase gene expression vector 

In this Example, an expression vector for Saccharomyces cerevisiae, complementary to a 
TRP1 gene, including an endo- $ -N-acetylglucosaminidase gene and a GAPDH gene 
promoter-PGK terminator was constructed. 

To obtain the open reading frame encoding the 744 amino acids confirmed in Example 3, 
DNA primers based on DNA sequences equivalent to the amino acid sequences of the N- 
terminus and C-terminus to which a Not I site had been added at both termini, were 
synthesized and PCR was performed with pZL-Endo as a template thereby obtaining an 
amplified fragment. The following are the sequences of the sense and antisense primers: 
Endo-Not-F (sense primer) 

5' GGGGCGGCCGCTTTTATTTTACATAAATATGCCTTCACTTC 3' (SEQ ID NO: 

15) 

Endo-Not-R (antisense primer) 

5' CCCGCGGCCGCCTAGTTTAATGACAAATCTATGCTACC 3' (SEQ ID NO: 16) 

After separation by agarose gel electrophoresis, the amplified fragment was recovered and 
purified using Prep-A-Gene DNA Purification System (Bio-Rad). Further, after digestion of 
this fragment with Not I, it was purified and inserted into the Not I of pBluescript II KS+ 
thereby producing pBlue-Endo-Not. 

Since the novel endo- & -N-acetylglucosaminidase gene is the gene derived from mold, it was 
thought that expression in yeast was suitable. An expression plasmid for Saccharomyces 
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cerevisiae with trpl gene as a selective marker, said trp 1 gene comprising a Saccharomyces 
cerevisiae glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) promoter, a 3- 
phophoglycerate kinase (PGK) gene terminator, and a tryptophan synthesis gene TRP1 gene, 
was constructed with expression vector pG-3 (Methods in Enzymology Vol. 194 p. 389). 
Said pG-3-Not was constructed by digesting pG-3 with BamHI, blunt-ending by Klenow 
processing followed by addition of a Not I linker. 

The above-mentioned pBlue-Endo-Not was digested with Not I and an insertion fragment of 
approximately 2.3kb separated and purified by agarose gel electrophoresis. This fragment was 
inserted at the Not I site of pG-3-Not thereby constructing pGEndo-SC. (Fig. 8) 

Example 8: Expression of novel endo- /3 -N-acetylglucosaminidase in Saccharomyces 
cerevisiae. 

A pep4 gene disrupted strain of the yeast Saccharomyces cerevisiae YPH500 strain 
(Stratagene) was used as a host. A pep4 gene disrupted strain was produced by the method of 
Sikorski, R.S. and Hieter, P (Genetics Vol. 122 19-27 (1989)). The said strain was 
transformed with 10 A g of pGEndo-SC. Transformation was performed by the lithium acetate 
method (see W095/32289) and transformants were selected on a culture plate not including 
tryptophan (yeast nitrogen base 0.67%, casamino acid 0.5%, glucose 1%). 

Confirmation of the activity of intracellular novel endo- /3 -N-acetylglucosaminidase was 
performed in respect of the transformants. For the transformed cells cultured for 2 days at 
30°C in a 5mL YPD medium (yeast extract 1 %, polypeptone 2%, glucose 2%), centrifugation 
was performed at 1500g for 5 minutes at 4°C, thereby separating the supernatant and the cell 
mass. The cell mass was washed with distilled water. Ten microliters of a mixture of 50mM 
potassium sulfate buffer (pH 6.0) and 5mM EDTA was added to the cell mass and suspended 
well. Further, 50mg of glass beads was added and after vigorous stirring, centrifuged, and the 
supernatant taken was as the cell extract. 
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Measurement of activity was performed by TLC or HPLC using DNS-GP as a substrate. 
Results with TLC are indicated in Figure 9. As with the sample reacted with the enzyme 
purified from the supernatant of the Mucor hiemalis culture, a peak identical with the 
dansylated asparagyl acetylglucosamine (DNS-Asn-GlcNAc) was obtained from the pGEndo- 
SC product. On the other hand, no peak corresponding to DNS-Asn-GlcNAc was detected 
from the culture supernatant of the negative control strain transformed with pG-3-Not. The 
pGEndo-SC cell extract was concentrated by a factor of ten, desalted, reacted with DNS-GP 
and the peak corresponding to DNS-Asn-GlcNAc was fractionated using HPLC under the 
above-mentioned conditions. The fractionated samples were concentrated with an evaporator, 
and mass spectrometry analysis was performed thereon. As a result, it was confirmed that the 
mass spectrometry results matched with the DNS-Asn-GlcNAc results. Thus it was apparent 
that the gene product encoded by the pGEndo-SC insertion fragment was a novel endo- @ -N- 
acetylglucosaminidase. 

Table 3 indicates the activity (product amount) per liter of the culture of the novel endo- 0 -N- 
acetylglucosaminidase of this invention. This activity was 48 times the value for Mucor 
hiemalis. 



TABLE 3 Novel endo- /3 -N-acetylglucosaminidase activity 



Activity (Unit/Liter) 


M. hiemalis culture supernatant 


0.9 


Culture fluid of S. cerevisiae into which 


43.2 


novel endo- 0 -N-acetylglucosaminidase gene 




was introduced* 





*After cell collection from the culture, the cells were disrupted with glass beads. After 
separation of the supernatant using a centrifuge, the activity of the supernatant was measured, 
and the activity per culture volume was calculated from this value. 

This specification incorporates the content described in the specification and/or the drawings 
of Japanese Patent Application No. 10-141717 which is the application upon which priority 
for the present application is based. 
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All publications, patents and patent applications cited herein are incorporated herein by 
reference in their entirety. 

Industrial Applicability 

The present invention provides an endo- jS -N-acetylglucosaminidase, an endo- & -N- 
acetylglucosaminidase gene, a recombinant plasmid which comprises said gene, an organism 
transformed with said plasmid, and a method of producing endo- /3 -N-acetylglucosaminidase. 

By introducing a vector which comprises the gene of the present invention into a host, and 
causing the gene to be expressed, endo- jS -N-acetylglucosaminidase can be produced 
efficiently and in large quantities. 

The enzyme of the present invention is an industrially important enzyme in the analysis and 
assay of sugar chains and the modification of sugar chains. The transformants obtained by the 
present invention produce the subject enzyme in large quantities and make a large 
contribution to industries that employ these enzymes. 

Free Text of Sequence Listing 

SEQ ED NO: 4: A partial amino acid sequence of endo- /? -N-acetylglucosaminidase 

SEQ ID NO: 5: An oligonucleotide designed from a partial amino acid sequence of endo-/3 - 

N-acetylglucosaminidase 

SEQ ID NO: 5: n indicates a, g, c, or t (Location: 12) 

SEQ ID NO: 6: A partial amino acid sequence of endo- $ -N-acetylglucosaminidase 

SEQ ID NO: 7: An oligonucleotide designed from a partial amino acid sequence of endo- /3 - 

N-acetylglucosaminidase 

SEQ ID NO: 7: n indicates a, g, c, or t (Location: 6) 

SEQ ID NO: 8: An oligonucleotide designed from a partial amino acid sequence of endo-/3 - 
N-acetylglucosaminidase 
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SEQ ID NO: 8: n indicates a, g, c, or t (Location: 15) 

SEQ ID NO: 9: A partial amino acid sequence of endo- & -N-acetylglucosaminidase 

SEQ ID NO: 10: An oligonucleotide designed from a partial amino acid sequence of 

endo- j3 -N-acetylglucosaminidase 

SEQ ID NO: 11: An oligonucleotide sequence of the 5' terminal region of an endo- 

j8 -N-acetylglucosaminidase gene 

SEQ ID NO: 12: An oligonucleotide sequence of the 3' terminal region of an endo- 

# -N-acetylglucosaminidase gene 

SEQ ID NO: 13: An oligonucleotide designed from the nucleotide sequence of an 

endo- /3 -N-acetylglucosaminidase gene 

SEQ ID NO: 14: An oligonucleotide designed from the nucleotide sequence of an 

endo- & -N-acetylglucosaminidase gene 

SEQ ID NO: 15: An oligonucleotide designed from the nucleotide sequence of an 

endo- /S -N-acetylglucosaminidase gene 

SEQ ID NO: 16: An oligonucleotide designed from the nucleotide sequence of an 

endo- /3 -N-acetylglucosaminidase gene 

SEQ ID NO: 20: Xaa indicates Met or Ser (Location: 2) 

SEQ ED NO: 21 : Xaa indicates Gly or Met (Location: 2) 

SEQ ID NO: 21 : Xaa indicates Gin or Ala (Location: 3) 

SEQ ID NO: 21 : Xaa indicates Arg or Leu (Location: 4) 

SEQ ID NO: 21 : Xaa indicates Asn or Pro (Location: 6) 

SEQ ID NO: 21 : Xaa indicates Arg or Leu (Location: 8) 

SEQ ID NO: 21: Xaa indicates Glu or Leu (Location: 9) 

SEQ ID NO: 21 : Xaa indicates Ser or Leu (Location: 10) 

SEQ ID NO: 21 : Xaa indicates His or Thr (Location: 11) 

SEQ ID NO: 27: carboxymethylcystein (Location: 3) 

SEQ ID NO: 28: carboxymethylcystein (Location: 6) 

SEQ ID NO: 33: carboxymethylcystein (Location: 8) 
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CLAIMS 



1. A recombinant protein selected from the group consisting of: 

(a) a protein comprising the amino acid sequence represented by SEQ ID NO: 3; 
and, 

(b) a protein comprising an amino acid sequence derived from the amino acid 
sequence represented by SEQ ID NO: 3 by deletion, substitution, insertion, or 
addition of at least one amino acid and having the activity of endo- /3 -N- 
acetylglucosaminidase. 

2. An endo- /5 -N-acetylglucosaminidase gene encoding: 

(a) a protein having the amino acid sequence represented by SEQ ID NO: 3; or, 

(b) a protein having an amino acid sequence represented by SEQ ID NO: 3 by 
deletion, substitution, insertion, or addition of at least one amino acid and having 
the activity of endo- j9 -N-acetylglucosaminidase. 

3. A gene comprising the following DNA: 

(c) a DNA consisting of a nucleotide sequence represented by SEQ ID NO: 2; or, 

(d) a DNA which hybridizes under stringent conditions with a DNA consisting of a 
nucleotide sequence represented by SEQ ID NO: 2, and which encodes a protein 
having endo- $ -N-acetylglucosaminidase activity. 

4. A gene which hybridizes under stringent conditions with the gene according to claim 2, and 
which comprises DNA encoding a protein having endo- /3 -N-acetylglucosaminidase 
activity. 

5. The gene according to any one of claims 2 to 4, wherein the gene is derived from a 
microorganism belonging to the genus Mucor. 

6. The gene according to claim 5, wherein the microorganism belonging to the genus Mucor is 
Mucor hiemalis. 

7. A recombinant vector which comprises the gene according to any one of claims 2 to 6. 

8. A transformant which comprises the recombinant vector of claim 7. 

9. A method of producing endo- jS -N-acetylglucosaminidase comprising culturing the 
transformant according to claim 8 and collecting endo- /3 -N-acetylglucosaminidase from 
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the culture product. 
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09/700993 



FIG. 1 



1 2 

mm 97kDa 

- 66kDa 
60kDa(p60) jiff | i 

M 45kDa 



31kDa 



21kDa 



14kDa(pl4) *-*14kDa 



Purification Result for Endo- 0 -N-acetylglucosaminidase (15-25% gradient SDS-PAGE) 

Lane 1: Purified endo- /5 -N-acetylglucosaminidase from Mucor hiemalis 
Lane 2: Molecular weight markers 
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p- | Q 2 Novel endo- jS -N-acetylglucosaminidase gene 



Not I Pvu II Hind III Bell Sac I EcoRI EcoT22I Sail 




pZL-Endo 



Restriction enzyme map for pZL-Endo including the full-length 
novel endo- /? -N-acetylglucosaminidase gene. 
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FIG. 3 

10 20 30 40 50 60 

GTCGACCCAC GCGTCCGCGG ACGCGTGGGC GGACCCGTGG GCGGACGCGT GGGTnTATT 

70 80 90 100 110 120 

TTACATAAAT ATCCCTTCAC TTCAATTCCA. ACCTGATGAC AAACTAGCAC aamTlT 

130 140 150 160 170 180 

TOCACTTAAG TCTATCAATG AGTTGAGGGA CTGGACGCCA GACGAAAAGA TAAAGTTTAA 

190 200 210 220 230 .240 

CGTITCAAGC GTQGCACTAC AGCCTCGTGT GAAAAACGCC CTGAAACCTC AATTATTGTT 

250 260 270 280 290 300 

AACTCATCAT ATGGCAGGAG GATATAAAGA AGATAAAAAT ATTCAAGGAA ACAATTATAA 

310 320 330 340 350 360 

AGACATITAT AACATTCAAT ATTGGCATTT AGCTGATACT TTTGTATA1T TCTCTCATGA 

370 380 390 400 410 420 

GCGAGTEAGC ATTCCTCCAG TCAATTGGAC AAATGCTTGT CATAGAAATG GTGTAAAGTG 

430 440 450 460 470 480 

TTTAGGTACT TTITEAGTAG AAGGAAATAA CCAAATGCAT GAAATGGAAG CCTTGCTTCA 

490 500 510 520 530 540 

CGGTCCACCT TTACTTAATA ACACTGACGA CCCTATGAGA TTATGGAGTC CGTATTATGC 

550 560 570 580 590 600 

AGACCAATTA GOTGCTATTG CTAAACACTA TGGTrTTGAT GGCTGGTTGT TCAATATTGA 

610 620 630 640 650 660 

ATCCGAATTC TTTCCnTTC CTACAAATCC AAAATTCAAA GCTGAAGAGT TGGCAAAGTT 

670 680 690 700 710 720 

TCTACACXAT TTEAAGGAAA AATTGCATAA CGAAATACCT GGATCTCAAC TCATTTGGTA 

730 740 750 760 770 780 

CGACAGCATG ACAAATGAAG GAGAAATCCA CTGGCAGAAC CAGCTCACAT GGAAAAATGA 

790 800 810 820 830 840 

GTrATTTTTT AAAAACACGG ATGGTATTTT TTTGAATTAT TGGTGGAAAA AAGAATACCC 

850 860 870 880 890 900 

TGAAATGGCG CGTAGAGTAG CTCAAGGAAT AGGTAGATCA GGTITAGAAG TTTATTTTGG 

910 920 930 940 950 960 

TACAGATGTA TOGGGAAGGC ATACTTATGG TGGCGGTGGT TTCAAATCAT ATAAGGGTGT 

970 980 990 1000 1010 1020 

AAAAACTGCC "EACTCT3CAA TGACATCTTC TGCATTATTT GGTATGGCAT GGACATACGA 

1030 1040 1050 1060 1070 1080 

GCATTTCGAA AAGTCTGAAT TTGAAAAGAT GGATCGTTIX3 TTTTGGTGTG GTGGTAAATA 

1090 1100 1110 1120 1130 1140 

CTCTGACTRT CCfCCCCCAC CTCCTAAAAA CCCAGATOAC GAAAAAGAAG TAGAAAGCGA 

1150 1160 1170 1180 1190 1200 

TGATAGTCAA GATCAGCTCA TGTACGGACA CAAGAAAGGT ATIGCIGACA CGGTAGAATC 

1210 1220 1230 1240 1250 1260 

TATTCCTCTA CCAGGAACAG ATTGGTTTGT TACCAATTTT GATAGGGGGT TTGGAAATAG 

1270 1280 129 0 1300 1310 1320 

GTITTATTAT AGAGGAAAGA GATTACTTTC TCAGCCTTGG TCCCATITAT CGCATCAAGC 

1330 1340 1350 1360 1370 1380 

TATTCTCCCC AATAAAAGCT ATCGAAATCC AGAGATTTAT CCCACTGATC AAAACATTAA 



Entire nucleotide sequence of the fragment inserted into the Sal I-Not I sites of pZL-Endo 
including the full-length novel endo- -N-acetylglucosaminidase gene. 
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.4 



1390 


1400 


1410 


1420 


1430 


1440 




TCTCTCGATT 








CGCTTATTAT 


1450 


1460 


1470 


1480 


1490 


1500 


CAAAGGCCAA 


CGTTTCAATC 


ATAGAGAATC 


GCATGATGTT 


GAAACTGAAA 


TTAGTATACC 


1510 


1520 


1530 


1540 


1550 


1560 


TCTGTATAAG 


CTTICATTAG 


ATGCTAGTAA 


AGGATGCTCA 


TFGCGTTFATA 


TTTATAGAAC 


1570 


1580 


1590 


1600 


1610 


1620 


TITGTTGATG 


AAAGATGTAA 


AGTTGACAGT 


AGCATGTCAC 


TTTTCGTTAA 


AAACAAACGA 


1630 


1640 


1650 


1660 


1670 


1680 


CTCAGTEAAT 


TTCTTCAAGG 


TATGGCAGCC 


AGATGAAAAT 




AATATGATGA 


1690 


1700 


1710 


1720 


1730 


1740 


TQGAAT3AGA 


GCCACTGTEA 


CAACTGAAAA 


TTCTACCGAA 


AGCAGATGCT 


TT1UATTACG 


1750 


1760 


1770 


1780 


1790 


1800 


TACAACAGAA 


GAAGATACAG 


GAGAAAATGA 


TTGGATAACA 


AAAACTATTA 


ATGTGCCTGC 


1810 


1820 


1830 


1840 


1850 


1860 


TGTTCCAGAA 


GGAAGTCAAT 


TATACATTAC 


AAGACTTGAA 


GTGAGCGTAG 


TATTAGATAC 


1870 


1880 


1890 


1900 


1910 


1920 


AUL'lUGTriA 


GTAGGTCTTG 


1TAATCAAGT 


TATTGCTTGC 


TTGGGATATA 


TTAGCATCAT 


1930 


1940 


1950 


1960 


1970 


1980 


ACCAACTATA 


AATICTGGAA TAAAAACAGA 


TTCATCACGC 


ATTATTCAGG 


ATCrCTiTIG 


1990 


2000 


2010 


2020 


2030 


2040 


GAAAGATCAG 


AAATATACCA AAATCGGAAA 


AGAAAGTTTA 


GACGACATAG 


CTCAAGAAGA 


2050 


2060 


2070 


2080 


2090 


2100 


AGTTCATAGA 


TATTATGGAA 


CATTGAACTG 


GGAAAACACA 


GCAAATGTAG 


TAAACGCTTG 


2110 


2120 


2130 


2140 


2150 


2160 


GGAGGAAATA 


GATTACIACA 


ACGTTmTA 


CAAAGAAAGT 


GACGACTCTG 


CAACTCGCAT 


2170 


2180 


2190 


2200 


2210 


2220 


CTTTTTAGGA 


ACAGCATTCT GTAATCAATT 


TCGTGTATCT 


GGTTTAGATA 


TTA1TTTATC 


2230 


2240 


2250 


2260 


2270 


2280 


TAAGCTACCA 


AAGATAGTTA 


TTGAAGCTGT 


TAACAAAGAA 


GGATACATCT 


CTTCAAGTGG 


2290 


2300 


2310 


2320 


2330 


2340 


TAGCATAGAT 


TTGTCATTAA ACTAGGACTT 


GAAATAAAAT 


ATTATGATAA 


AGAAAAAAAA 


2350 


2360 


2370 


2380 


2390 


2400 


AAAAAAAAAA 


AAAAAAAAAG 


GGCGGCCGC. 









Entire nucleotide sequence of the fragment inserted into the Sal I-Not I sites of pZL-Endo 
including the full-length novel endo- /9 -N-acetylglucosaminidase gene. (Continued) 
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FIG 5 09/700993 

' ^ " 9 18 27 36 45 54 

5' ATG CCT TCA CTT CAA TTG CAA CCT GAT GAC AAA CTA GCA CCT GTT TCT ITT GCA 

MPSLQLQPDDKLAPVSFA 

63 72 81 90 99 10B 

CTT AAG TCT ATG AAT GAG TTG AGG GAC TGG ACG CCA GAC GAA AAG ATA AAG TXT 



M H E M 



Amino acid sequence deduced from the novel Endo- /9 -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. 
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711 720 729 738 747 756 

AAA AAT GAG TTA TTT TTT AAA AAC ACG GAT GOT ATT TTT TTG AAT TAT TGG TOG 

KNELFFKNTDGIFLNYWW 

765 774 783 792 801 810 

AAA AAA GAA TAC OCT GAA ATG GCG CGT AGA GTA GOT GAA GGA ATA GGT AGA TCA 

KKEYPEMARRVAEGIGRS 

819 828 837 846 855 864 

GGT TTA GAA GTT TAT, TTT GGT ACA GAT GTA TGG GGA AGG CAT ACT TAT GGT GGC 

GLEVYFGTDVWGRHTYGG 

873 882 891 900 909 918 

GGT GGT TTC AAA TCA TAT AAG GGT GTA AAA ACT GCC TAC TCT GCA ATG ACA TCT 

GGFKSYK GVKTAYSAMTS 

927 936 945 954 963 972 

TCT GCA TTA TTT GGT ATG GCA TGG ACA TAC GAG CAT TTC GAA AAG TCT GAA TTT 

SALFGMAWTYEHFEKSEF 

981 990 999 1008 1017 1026 

GAA AAG ATG GAT CGT TTG TTT TGG TGT GGT GGT AAA TAC TCT GAC TAT CCT CCC 

EKMDRLFWCGGKYSDYPP 

1035 1044 1053 1062 1071 1080 

CCA CCT CCT AAA AAC CCA GAT GAC GAA AAA GAA GTA GAA AGC GAT GAT AGT GAA 

PPPKHPDDEKEVESDDSE 

1089 1098 1107 1116 1125 1134 

GAT GAG CTC ATG TAC GGA CAC AAG AAA GGT ATT GCT GAC ACG GTA GAA TCT ATT 

DELMYGHKKGIADTVESI 

1143 1152 1161 1170 1179 1188 

CCT GTA CCA GGA ACA GAT TGG TTT GTT ACC AAT TTT GAT AGG GGG TTT GGA AAT 

PVPGTDWFVTNFDRGFGN 

1197 1206 1215 1224 1233 1242 

AGG TTT TAT TAT AGA GGA AAG AGA TTA CTT TCT CAG CCT TGG TCC CAT TTA TCG 

RFYYRGKRLltSQPWSHIiS 

1251 1260 1269 1278 1287 1296 

CAT CAA GCT ATT CTC CCC AAT AAA AGC TAT CGA AAT CCA GAG ATT TAT CCC ACT 

HQAILPHKSY. RNPEIYPT 

1305 1314 1323 1332 1341 1350 

GAT CAA AAC ATT AAA ATC ACT AGT TCT CTC GAT TGC GAT CAT GGA GCT TTT CTT 

DQNIKITSSLDCDHGAFL 

1359 1368 1377 1386 1395 1404 

GGT GGA ACC TCG CTT ATT ATC AAA GGC CAA CGT TTC AAT CAT AGA GAA TCG CAT 

GGTSLIIKGQRFNHRESH 

1413 1422 1431 1440 1449 1458 

GAT GTT GAA ACT GAA ATT AGT ATA CCT CTG TAT AAG CTT TCA TTA GAT GCT AGT 

DVETEISIPLYKLSLDAS 

Amino acid sequence deduced from the novel Endo- /9 -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. (Continued) 
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1467 



1476 



185 



1494 



1503 



1512 



AAA GGA TGC TCA TTG CGT TAT ATT TAT AGA ACT TIG TTG ATG AAA GAT GTA AAG 



KGCSL'RYI YRTLLMKDVK 

1521 1530 1539 1548 1557 1566 

TTG ACA GTA OCA TGT CAC TIT TCG TTA AAA ACA AAC GAC TCA GTT AAT TIC TIC 

LTVACHFSLKTNDSVNFF 

1575 1584 1593 1602 1611 1620 

AAG GTA TOG CAG CCA GAT GAA AAT TIC TCT TIT GAA TAT GAT GAT QGA ATG AGA 

KVWQPDENFSFEYDDGMR 

1629 1638 1647 1656 1665 1674 

GCC ACT GTT ACA ACT GAA AAT TCT AOC GAA AGC AGA TGC TTT OTA TTA CGT ACA 

ATVTTEMSTESRCFLliRT 

1583 1692 1701 1710 1719 1728 

ACA GAA GAA GAT ACA GGA GAA AAT GAT TGG ATA ACA AAA ACT ATT AAT GTS CCT 

TEEDTGENDWITKT INVP 

1737 1746 1755 1764 1773 ' 1782 

GCT GTT CCA GAA GGA AGT CAA TTA TAC ATT ACA AGA CTT GAA GTG AGC GTA GTA 

AVPEGSQLYITRI.EVSVV 

1791 1300 1809 1818 1827 1836 

TTA GAT ACA GCT GOT TTA GTA GGT CTT GTT AAT CAA GTT ATT GCT TGC TTG QGA 

LDTAGLVGLVNQVIACLG 

1845 1854 1863 1872 1881 1890 

TAT ATT AGC AOC ATA CCA ACT ATA AAT TCT QGA ATA AAA ACA GAT TCA TCA CSC 

YISIIPTINSGIKTDSSR 

1899 1908 1917 1926 1935 1944 

ATT ATT CAG GAT CTC TTT TOG AAA GAT CAG AAA TAT ACC AAA ATC QGA AAA GAA 

IIQDLFWKDQKYTKIGKB 

1953 1962 1971 1980 1989 1998 

AGT TTA GAC GAC ATA GCT CAA GAA GAA GTT CAT AGA TAT TAT GGA ACA TTG AAC 

SI.DDI AQEEVHRYYGTLN 

2007 2016 2025 2034 2043 2052 

TGG GAA AAC ACA GCA AAT GTA GTA AAC GCT TGG GAG GAA ATA GAT TAC TAC AAC 

WEHTANVVN AWEEIDYYN 

2061 2070 2079 2088 2097 2106 

GTT TIT TAC AAA GAA AGT GAC GAC TCT GCA ACT CGC ATC TTT TTA GGA ACA GCA 

VFYKESDDSATRIFI.GTA 

2115 2124 2133 2142 2151 2160 

TTC TGT AAT CAA TTT CGT GTA TCT GGT TTA GAT ATT ATT TTA TCT AAG CTA CCA 

FCSQFRVSGLDI ILSKLP 

2169 2178 2187 2196 2205 2214 

AAG ATA GTT ATT GAA GCT GTT AAC AAA GAA GGA TAC ATC TCT TCA AGT GGT AGC 

KIVIEAVHKEGYISSSGS 

2223 2232 
ATA GAT TTG TCA TTA AAC TAG 3' 

I D L S L N * 



Amino acid sequence deduced from the novel Endo- 0 -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. (Continued) 
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09/700993 

FIG. 8 



Novel endo- j3 -N-acetylglucosaminidase gene 




2.5 kbp 



Structure of expression vector pGEndo-SC for the use in Saccharomyces cerevisiae, which 
comprises a novel endo- jS -N-acetylglucosaminidase gene. 



FIG. 9 



09/700993 




1 2 3 4 5 



Expression of endo-0 -N-acetylglucosannnidase enzyme in yeast into which 
an endo- & -N-acetylglucosaminidase gene has been introduced. 

Lanes 1-3: Cellular extract of S. cerevisiae YPH500 (pep4) into which 

an endo- 0 -N-acetylglucosaminidase gene has been introduced. 
Lane 4: Purified endo- 0 -N-acetylglucosaminidase derived from M. hiemalis 
Lane 5: Cellular extract of 5. cerevisiae YPH500 (pep4) 
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DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I HEREBY DECLARE: 

THAT my residence, post office address, and citizenship are as stated below next to my 

name; 

THAT I believe I am the original, first, and sole inventor {if only one inventor is named 
below) or an original, first, and joint inventor (if plural inventors are named below or in an 
attached Declaration) of the subject matter which is claimed and for which a patent is sought on 
the invention entitled 

ENDO-p-N-ACETYLGLUCOSAMINIDASE GENE 



(Attorney Docket No. 081356/0154) 

the specification of which (check one) 

is attached hereto. 

X was filed on May 20, 1999 as United States Application Number or 
PCT International Application Number PCT/JP99/02644 and was 
■j amended on (if applicable). 

THAT I do not know and do not believe that the same invention was ever known or used 
by others in the United States of America, or was patented or described in any printed 
publication in any country, before I (we) invented it; 

THAT 1 do not know and do not believe that the same invention was patented or 
described in any printed publication in any country, or in public use or on sale in the United 
States of America, for more than one year prior to the filing date of this United States 
application; 

THAT I do not know and do not believe that the same invention was first patented or 
made the subject of an inventor's certificate that issued in any country foreign to the United 
States of America before the filing date of this United States application if the foreign application 
was filed by me (us), or by my (our) legal representatives or assigns, more than twelve months 
(six months for design patents) prior to the filing date of this United States application; 

THAT I have reviewed and understand the contents of the above-identified specification, 
including the claim(s), as amended by any amendment specifically referred to above; 

THAT I believe that the above-identified specification contains a written description of 
the invention, and of the manner and process of making and using it, in such full, clear, concise, 
and exact terms as to enable any person skilled in the art to which it pertains, or with which it is 
most nearly connected, to make and use the invention, and sets forth the best mode 
contemplated by me of carrying out the invention; and 

THAT I acknowledge the duty to disclose to the U.S. Patent and Trademark Office all 
information known to me to be material to patentability as defined in Title 37, Code of Federal 
Regulations, §1 .56. 
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I HEREBY CLAIM foreign priority benefits under Title 35, United States Code §119(a)-(d) 
or § 365{b) of any foreign application(s) for patent or inventor's certificate, or §365(a) of any 
PCT international application which designated at least one country other than the United States 
of America, listed below and have also identified below any foreign application for patent or 
inventor's certificate or of any PCT international application having a filing date before that of the 
application on which priority is claimed. 



Prior Foreign 
Application Number 


Country 


Foreign Filing Date 


Priority 
Claimed? 


Certified 

Copy 
Attached? 


10-141717 


Japan 


May 22, 1998 


Yes 

























I HEREBY CLAIM the benefit under Title 35, United States Code § 1 19(e) of any United 
States provisional application(s) listed below. 



U.S. Provisional Application Number 


Filing Date 















I HEREBY CLAIM the benefit under Title 35, United States Code, §120 of any United 
States application(s), or § 365(c) of any PCT international application designating the United 
States of America, listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT International application in the 
manner provided by the first paragraph of Title 35, United States Code, § 1 1 2, I acknowledge 
the duty to disclose information which is material to patentability as defined in Title 37, Code of 
Federal Regulations, § 1 .56 which became available between the filing date of the prior 
application and the national or PCT international filing date of this application. 



U.S. Parent 
Application Number 


PCT Parent 
Application Number 


Parent 
Filing Date 


Parent 
Patent Number 



























I HEREBY APPOINT the following registered attorneys and agents of the law firm of 
FOLEY & LARDNER: 



STEPHEN A. BENT 


Reg. 


No. 


29,768 


DAVID A. BLUMENTHAL 


Reg. 


No. 


26T257~- 


BETH A. BURROUS 


Reg. 


No. 


35,087 


ALAN I. CANTOR 


Reg. 


No. 


musr 


WILLIAM T. ELLIS 


Reg. 


No. 




JOHN J. FELDHAUS 


Reg. 


No. 




PATRICIA D. GRAN ADOS 


Reg. 


No. 


33.683 
33,715 " 


JOHN P. ISACSON 


Reg. 


No. 
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MICHAEL D. KAMINSKI 


Reg. 


No. 


32,904 


LYLE K. KIMMS 


Reg. 


No. 


34,079 


KENNETH E. KROSIN 


Reg. 


No. 


25,735 


JOHNNY A. KUMAR 


Reg. 


No. 


34,649 


GLENN LAW 


Reg. 


No. 


34.37JT 


PETER G. MACK 


Reg. 


No. 




BRIAN J. MC NAMARA 


Reg. 


No. 


32,789 


SYBIL MELOY 


Reg. 


No. 


22,749 


RICHARD C. PEET 


Reg. 


No. 


3T779T- 


GEORGE E. QUILLIN 


Reg. 


No. 




BERNHARD D. SAXE 


Reg. 


No. 




CHARLES F. SCHILL 


Reg. 


No. 


27.590 


RICHARD L. SCHWAAB 


Reg. 


No. 


25.479 


HAROLD C. WEGNER 


Reg. 


No. 


25,258__ 



to have full power to prosecute this application and any continuations, divisions, reissues, and 
reexaminations thereof, to receive the patent, and to transact all business in the United States 
Patent and Trademark Office connected therewith. 



I request that all correspondence be directed to: 



Stephen A. Bent 

FOLEY & LARDNER 

Washington Harbour 

3000 K Street, N.W., Suite 500 

Washington, D.C. 20007-5109 

Telephone: (202) 672-5404 
Facsimile: (202) 672-5399 



I UNDERSTAND AND AGREE THAT the foregoing attorneys and agents appointed by me 
to prosecute this application do not personally represent me or my legal interests, but instead 
represent the interests of the legal owner(s) of the invention described in this application. 

I FURTHER DECLARE THAT all statements made herein of my own knowledge are true, 
and that all statements made on information and belief are believed to be true; and further that' 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code, and that such willful false statements may jeopardize the validity of the 
application or any patent issuing thereon. 



Name of first inventor 

Residence 

Citizenship 

Post Office Address 

Inventor's signature 
Date 



Kazuo K obayashi 



Y^laahiiiaa^hi, JAPAN Q 



c/o Kiban Gijutsu Kenkyusho, Kirin Beer Kabushiki Kaisha, 1-13-5, 
Fukuura, Kanazawa-ku, Yokohama-shi, Kanagawa 236-0004 Japan 





July 6, 2001 
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Date 
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Name of second inventor's legal representative 



Ypriko TAKEUCHI 
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Post Office Address 



Yokohama-shi, JAPAN 



Japanese 
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Fukuura, Kanazawa-ku, Yokohama-shi, Kanagawa 236-0004 Japan 



Legal representative's signature 
Date 
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Residence 

Citizenship 

Post Office Address 

Inventor's signature 
Date 

Name of fourth inventor 

Residence 

Citizenship 

Post Office Address 

Inventor's signature 

Date 
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Akihiko Iwamatsu 



Yokohama-shi, JAPAN -J 
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Japanese 
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Hidehik o Kumagai 
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32-2, Hiyoshidai 3-chome, 
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SEQUENCE LISTING 

<110> KIRIN BEER KABUSHIKI KAISHA 

<120> Endo- p -N-Acetylglucosaminidase Gene 

<130> PH-657-PCT 

<140> 
<141> 

<150> JP98/141717 
<151> 1998-05-22 

<160> 37 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2369 
<212> DNA 

<213> Mucor hiemalis 
<400> 1 

gtcgacccac gcgtccgcgg acgcgtgggc ggacgcgtgg gcggacgcgt gggttttatt 60 
ttacataaat atgccttcac ttcaattgca acctgatgac aaactagcac ctgtttcttt 120 
tgcacttaag tctatgaatg agttgaggga ctggacgcca gacgaaaaga taaagtttaa 180 
cgtttcaagc gtggcactac agcctcgtgt gaaaaacgcc ctgaaacctc aattattgtt 240 
aactcatgat atggcaggag gatataaaga agataaaaat attcaaggaa acaattataa 300 
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agacatttat aacattcaat attggcattt agctgatact tttgtatatt tctctcatga 360 
gcgagttagc attcctccag tcaattggac aaatgcttgt catagaaatg gtgtaaagtg 420 
tttaggtact tttttagtag aaggaaataa ccaaatgcat gaaatggaag ccttgcttca 480 
cggtccacct ttacttaata acactgacga ccctatgaga ttatggagtc cgtattatgc 540 
agaccaatta gttgctattg ctaaacacta tggttttgat ggctggttgt tcaatattga 600 
atgcgaattc tttccttttc ctacaaatcc aaaattcaaa gctgaagagt tggcaaagtt 660 
tctacactat tttaaggaaa aattgcataa cgaaatacct ggatctcaac tcatttggta 720 
cgacagcatg acaaatgaag gagaaatcca ctggcagaac cagctcacat ggaaaaatga 780 
gttatttttt aaaaacacgg atggtatttt tttgaattat tggtggaaaa aagaataccc 840 
tgaaatggcg cgtagagtag ctgaaggaat aggtagatca ggtttagaag tttattttgg 900 
tacagatgta tggggaaggc atacttatgg tggcggtggt ttcaaatcat ataagggtgt 960 
aaaaactgcc tactctgcaa tgacatcttc tgcattattt ggtatggcat ggacatacga 1020 
gcatttcgaa aagtctgaat ttgaaaagat ggatcgtttg ttttggtgtg gtggtaaata 1080 
ctctgactat cctcccccac ctcctaaaaa cccagatgac gaaaaagaag tagaaagcga 1140 
tgatagtgaa gatgagctca tgtacggaca caagaaaggt attgctgaca cggtagaatc 1200 
tattcctgta ccaggaacag attggtttgt taccaatttt gatagggggt ttggaaatag 1260 
gttttattat agaggaaaga gattactttc tcagccttgg tcccatttat cgcatcaagc 1320 
tattctcccc aataaaagct atcgaaatcc agagatttat cccactgatc aaaacattaa 1380 
aatcactagt tctctcgatt gcgatcatgg agcttttctt ggtggaacct cgcttattat 1440 
caaaggccaa cgtttcaatc atagagaatc gcatgatgtt gaaactgaaa ttagtatacc 1500 
tctgtataag ctttcattag atgctagtaa aggatgctca ttgcgttata tttatagaac 1560 
tttgttgatg aaagatgtaa agttgacagt agcatgtcac ttttcgttaa aaacaaacga 1620 
ctcagttaat ttcttcaagg tatggcagcc agatgaaaat ttctcttttg aatatgatga 1680 
tggaatgaga gccactgtta caactgaaaa ttctaccgaa agcagatgct ttttattacg 1740 
tacaacagaa gaagatacag gagaaaatga ttggataaca aaaactatta atgtgcctgc 1800 
tgttccagaa ggaagtcaat tatacattac aagacttgaa gtgagcgtag tattagatac 1860 
agctggttta gtaggtcttg ttaatcaagt tattgcttgc ttgggatata ttagcatcat 1920 
accaactata aattctggaa taaaaacaga ttcatcacgc attattcagg atctcttttg 1980 
gaaagatcag aaatatacca aaatcggaaa agaaagttta gacgacatag ctcaagaaga 2040 
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agttcataga tattatggaa cattgaactg 
ggaggaaata gattactaca acgtttttta 
ctttttagga acagcattct gtaatcaatt 
taagctacca aagatagtta ttgaagctgt 
tagcatagat ttgtcattaa actaggactt 
aaaaaaaaaa aaaaaaaaag ggcggccgc 



ggaaaacaca gcaaatgtag taaacgcttg 2100 
caaagaaagt gacgactctg caactcgcat 2160 
tcgtgtatct ggtttagata ttattttatc 2220 
taacaaagaa ggatacatct cttcaagtgg 2280 
gaaataaaat attatgataa agaaaaaaaa 2340 

2369 



<210> 2 
<211> 2235 
<212> DNA 

<213> Mucor hiemalis 

<220> 

<221> CDS 

<222> (1). . (2235) 

<400> 2 

atg cct tea ctt caa ttg caa cct gat gac aaa eta gca cct gtt tct 48 
Met Pro Ser Leu Gin Leu Gin Pro Asp Asp Lys Leu Ala Pro Val Ser 

15 10 15 

ttt gca ctt aag tct atg aat gag ttg agg gac tgg acg cca gac gaa 96 
Phe Ala Leu Lys Ser Met Asn Glu Leu Arg Asp Trp Thr Pro Asp Glu 

20 25 30 

aag ata aag ttt aac gtt tea age gtg gca eta cag cct cgt gtg aaa 144 
Lys He Lys Phe Asn Val Ser Ser Val Ala Leu Gin Pro Arg Val Lys 

35 40 45 

aac gee ctg aaa cct caa tta ttg tta act cat gat atg gca gga gga 192 
Asn Ala Leu Lys Pro Gin Leu Leu Leu Thr His Asp Met Ala Gly Gly 
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50 55 60 

tat aaa gaa gat aaa aat att caa gga aac aat tat aaa gac att tat 240 
Tyr Lys Glu Asp Lys Asn He Gin Gly Asn Asn Tyr Lys Asp He Tyr 
65 70 75 80 

aac att caa tat tgg cat tta get gat act ttt gta tat ttc tct cat 288 
Asn He Gin Tyr Trp His Leu Ala Asp Thr Phe Val Tyr Phe Ser His 

85 90 95 

gag cga gtt age att cct cca gtc aat tgg aca aat get tgt cat aga 336 
Glu Arg Val Ser He Pro Pro Val Asn Trp Thr Asn Ala Cys His Arg 

100 105 110 

aat ggt gta aag tgt tta ggt act ttt tta gta gaa gga aat aac caa 384 
Asn Gly Val Lys Cys Leu Gly Thr Phe Leu Val Glu Gly Asn Asn Gin 

115 120 125 

atg cat gaa atg gaa gec ttg ctt cac ggt cca cct tta ctt aat aac 432 
Met His Glu Met Glu Ala Leu Leu His Gly Pro Pro Leu Leu Asn Asn 

130 135 140 

act gac gac cct atg aga tta tgg agt ccg tat tat gca gac caa tta 480 
Thr Asp Asp Pro Met Arg Leu Trp Ser Pro Tyr Tyr Ala Asp Gin Leu 
145 150 155 160 

gtt get att get aaa cac tat ggt ttt gat ggc tgg ttg ttc aat att 528 
Val Ala He Ala Lys His Tyr Gly Phe Asp Gly Trp Leu Phe Asn He 

165 170 175 

gaa tgc gaa ttc ttt cct ttt cct aca aat cca aaa ttc aaa get gaa 576 
Glu Cys Glu Phe Phe Pro Phe Pro Thr Asn Pro Lys Phe Lys Ala Glu 

180 185 190 

gag ttg gca aag ttt eta cac tat ttt aag gaa aaa ttg cat aac gaa 624 
Glu Leu Ala Lys Phe Leu His Tyr Phe Lys Glu Lys Leu His Asn Glu 

195 200 205 

ata cct gga tct caa etc att tgg tac gac age atg aca aat gaa gga 672 
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He Pro Gly Ser Gin Leu He Trp Tyr Asp Ser Met Thr Asn Glu Gly 

210 215 220 

gaa ate cac tgg cag aac cag etc aca tgg aaa aat gag tta ttt ttt 720 
Glu He His Trp Gin Asn Gin Leu Thr Trp Lys Asn Glu Leu Phe Phe 
225 230 235 240 

aaa aac acg gat ggt att ttt ttg aat tat tgg tgg aaa aaa gaa tac 768 
Lys Asn Thr Asp Gly He Phe Leu Asn Tyr Trp Trp Lys Lys Glu Tyr 

245 250 255 

cct gaa atg gcg cgt aga gta get gaa gga ata ggt aga tea ggt tta 816 
Pro Glu Met Ala Arg Arg Val Ala Glu Gly He Gly Arg Ser Gly Leu 

260 265 270 

gaa gtt tat ttt ggt aca gat gta tgg gga agg cat act tat ggt ggc 864 
Glu Val Tyr Phe Gly Thr Asp Val Trp Gly Arg His Thr Tyr Gly Gly 

275 280 285 

ggt ggt ttc aaa tea tat aag ggt gta aaa act gee tac tct gca atg 912 
Gly Gly Phe Lys Ser Tyr Lys Gly Val Lys Thr Ala Tyr Ser Ala Met 

290 295 300 

aca tct tct gca tta ttt ggt atg gca tgg aca tac gag cat ttc gaa 960 
Thr Ser Ser Ala Leu Phe Gly Met Ala Trp Thr Tyr Glu His Phe Glu 
305 310 315 320 

aag tct gaa ttt gaa aag atg gat cgt ttg ttt tgg tgt ggt ggt aaa 1008 
Lys Ser Glu Phe Glu Lys Met Asp Arg Leu Phe Trp Cys Gly Gly Lys 

325 330 335 

tac tct gac tat cct ccc cca cct cct aaa aac cca gat gac gaa aaa 1056 
Tyr Ser Asp Tyr Pro Pro Pro Pro Pro Lys Asn Pro Asp Asp Glu Lys 

340 345 350 

gaa gta gaa age gat gat agt gaa gat gag etc atg tac gga cac aag 1104 
Glu Val Glu Ser Asp Asp Ser Glu Asp Glu Leu Met Tyr Gly His Lys 
355 360 365 
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aaa ggt att get gac acg gta gaa tct att cct gta cca gga aca gat 1152 
Lys Gly lie Ala Asp Thr Val Glu Ser He Pro Val Pro Gly Thr Asp 

370 375 380 

tgg ttt gtt acc aat ttt gat agg ggg ttt gga aat agg ttt tat tat 1200 
Trp Phe Val Thr Asn Phe Asp Arg Gly Phe Gly Asn Arg Phe Tyr Tyr 
385 390 395 400 

aga gga aag aga tta ctt tct cag cct tgg tec cat tta teg cat caa 1248 
Arg Gly Lys Arg Leu Leu Ser Gin Pro Trp Ser His Leu Ser His Gin 

405 410 415 

get att etc ccc aat aaa age tat cga aat cca gag att tat ccc act 1296 
Ala He Leu Pro Asn Lys Ser Tyr Arg Asn Pro Glu He Tyr Pro Thr 

420 425 430 

gat caa aac att aaa ate act agt tct etc gat tgc gat cat gga get 1344 
Asp Gin Asn He Lys He Thr Ser Ser Leu Asp Cys Asp His Gly Ala 

435 440 445 

ttt ctt ggt gga acc teg ctt att ate aaa ggc caa cgt ttc aat cat 1392 
Phe Leu Gly Gly Thr Ser Leu He He Lys Gly Gin Arg Phe Asn His 

450 455 460 

aga gaa teg cat gat gtt gaa act gaa att agt at a cct ctg tat aag 1440 
Arg Glu Ser His Asp Val Glu Thr Glu He Ser He Pro Leu Tyr Lys 
465 470 475 480 

ctt tea tta gat get agt aaa gga tgc tea ttg cgt tat att tat aga 1488 
Leu Ser Leu Asp Ala Ser Lys Gly Cys Ser Leu Arg Tyr He Tyr Arg 

485 490 495 

act ttg ttg atg aaa gat gta aag ttg aca gta gca tgt cac ttt teg 1536 
Thr Leu Leu Met Lys Asp Val Lys Leu Thr Val Ala Cys His Phe Ser 

500 505 510 

tta aaa aca aac gac tea gtt aat ttc ttc aag gta tgg cag cca gat 1584 
Leu Lys Thr Asn Asp Ser Val Asn Phe Phe Lys Val Trp Gin Pro Asp 
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515 520 525 

gaa aat ttc tct ttt gaa tat gat gat gga atg aga gcc act gtt aca 1632 
Glu Asn Phe Ser Phe Glu Tyr Asp Asp Gly Met Arg Ala Thr Val Thr 

530 535 540 

act gaa aat tct acc gaa age aga tgc ttt tta tta cgt aca aca gaa 1680 
Thr Glu Asn Ser Thr Glu Ser Arg Cys Phe Leu Leu Arg Thr Thr Glu 
545 550 555 560 

gaa gat aca gga gaa aat gat tgg ata aca aaa act att aat gtg cct 1728 
Glu Asp Thr Gly Glu Asn Asp Trp He Thr Lys Thr He Asn Val Pro 

565 570 575 

get gtt cca gaa gga agt caa tta tac att aca aga ctt gaa gtg age 1776 
Ala Val Pro Glu Gly Ser Gin Leu Tyr lie Thr Arg Leu Glu Val Ser 

580 585 590 

gta gta tta gat aca get ggt tta gta ggt ctt gtt aat caa gtt att 1824 
Val Val Leu Asp Thr Ala Gly Leu Val Gly Leu Val Asn Gin Val He 

595 600 605 

get tgc ttg gga tat att age ate ata cca act ata aat tct gga ata 1872 
Ala Cys Leu Gly Tyr He Ser lie He Pro Thr He Asn Ser Gly He 

610 615 620 

aaa aca gat tea tea cgc att att cag gat etc ttt tgg aaa gat cag 1920 
Lys Thr Asp Ser Ser Arg He He Gin Asp Leu Phe Trp Lys Asp Gin 
625 630 635 640 

aaa tat acc aaa ate gga aaa gaa agt tta gac gac ata get caa gaa 1968 
Lys Tyr Thr Lys He Gly Lys Glu Ser Leu Asp Asp He Ala Gin Glu 

645 650 655 

gaa gtt cat aga tat tat gga aca ttg aac tgg gaa aac aca gca aat 2016 
Glu Val His Arg Tyr Tyr Gly Thr Leu Asn Trp Glu Asn Thr Ala Asn 

660 665 670 

gta gta aac get tgg gag gaa ata gat tac tac aac gtt ttt tac aaa 2064 
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Val Val Asn Ala Trp Glu Glu lie Asp Tyr Tyr Asn Val Phe Tyr Lys 

675 680 685 

gaa agt gac gac tct gca act cgc ate ttt tta gga aca gca ttc tgt 2112 
Glu Ser Asp Asp Ser Ala Thr Arg He Phe Leu Gly Thr Ala Phe Cys 

690 695 700 

aat caa ttt cgt gta tct ggt tta gat att att tta tct aag eta cca 2160 
Asn Gin Phe Arg Val Ser Gly Leu Asp He He Leu Ser Lys Leu Pro 
705 710 715 720 

aag ata gtt att gaa get gtt aac aaa gaa gga tac ate tct tea agt 2208 
Lys He Val He Glu Ala Val Asn Lys Glu Gly Tyr He Ser Ser Ser 

725 730 735 

ggt age ata gat ttg tea tta aac tag 2235 
Gly Ser He Asp Leu Ser Leu Asn 

740 745 



<210> 3 
<211> 744 
<212> PRT 

<213> Mucor hiemalis 
<400> 3 

Met Pro Ser Leu Gin Leu Gin Pro Asp Asp Lys Leu Ala Pro Val Ser 

15 10 15 

Phe Ala Leu Lys Ser Met Asn Glu Leu Arg Asp Trp Thr Pro Asp Glu 

20 25 30 

Lys He Lys Phe Asn Val Ser Ser Val Ala Leu Gin Pro Arg Val Lys 

35 40 45 

Asn Ala Leu Lys Pro Gin Leu Leu Leu Thr His Asp Met Ala Gly Gly 
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50 55 60 

Tyr Lys Glu Asp Lys Asn He Gin Gly Asn Asn Tyr Lys Asp He Tyr 
65 70 75 80 

Asn He Gin Tyr Trp His Leu Ala Asp Thr Phe Val Tyr Phe Ser His 

85 90 95 

Glu Arg Val Ser lie Pro Pro Val Asn Trp Thr Asn Ala Cys His Arg 

100 105 110 

Asn Gly Val Lys Cys Leu Gly Thr Phe Leu Val Glu Gly Asn Asn Gin 

115 120 125 

Met His Glu Met Glu Ala Leu Leu His Gly Pro Pro Leu Leu Asn Asn 

130 135 140 

Thr Asp Asp Pro Met Arg Leu Trp Ser Pro Tyr Tyr Ala Asp Gin Leu 
145 150 155 160 

Val Ala He Ala Lys His Tyr Gly Phe Asp Gly Trp Leu Phe Asn He 

165 170 175 

Glu Cys Glu Phe Phe Pro Phe Pro Thr Asn Pro Lys Phe Lys Ala Glu 

180 185 190 

Glu Leu Ala Lys Phe Leu His Tyr Phe Lys Glu Lys Leu His Asn Glu 

195 200 205 

He Pro Gly Ser Gin Leu He Trp Tyr Asp Ser Met Thr Asn Glu Gly 

210 215 220 

Glu He His Trp Gin Asn Gin Leu Thr Trp Lys Asn Glu Leu Phe Phe 
225 230 235 240 

Lys Asn Thr Asp Gly He Phe Leu Asn Tyr Trp Trp Lys Lys Glu Tyr 

245 250 255 

Pro Glu Met Ala Arg Arg Val Ala Glu Gly lie Gly Arg Ser Gly Leu 

260 265 270 

Glu Val Tyr Phe Gly Thr Asp Val Trp Gly Arg His Thr Tyr Gly Gly 
275 280 285 
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Gly Gly Phe Lys Ser Tyr Lys Gly Val Lys Thr Ala Tyr Ser Ala Met 

290 295 300 

Thr Ser Ser Ala Leu Phe Gly Met Ala Trp Thr Tyr Glu His Phe Glu 
305 310 315 320 

Lys Ser Glu Phe Glu Lys Met Asp Arg Leu Phe Trp Cys Gly Gly Lys 

325 330 335 

Tyr Ser Asp Tyr Pro Pro Pro Pro Pro Lys Asn Pro Asp Asp Glu Lys 

340 345 350 

Glu Val Glu Ser Asp Asp Ser Glu Asp Glu Leu Met Tyr Gly His Lys 

355 360 365 

Lys Gly He Ala Asp Thr Val Glu Ser He Pro Val Pro Gly Thr Asp 

370 375 380 

Trp Phe Val Thr Asn Phe Asp Arg Gly Phe Gly Asn Arg Phe Tyr Tyr 
385 390 395 400 

Arg Gly Lys Arg Leu Leu Ser Gin Pro Trp Ser His Leu Ser His Gin 

405 410 415 

Ala He Leu Pro Asn Lys Ser Tyr Arg Asn Pro Glu lie Tyr Pro Thr 

420 425 430 

Asp Gin Asn He Lys He Thr Ser Ser Leu Asp Cys Asp His Gly Ala 

435 440 445 

Phe Leu Gly Gly Thr Ser Leu He He Lys Gly Gin Arg Phe Asn His 

450 455 460 

Arg Glu Ser His Asp Val Glu Thr Glu He Ser He Pro Leu Tyr Lys 
465 470 475 480 

Leu Ser Leu Asp Ala Ser Lys Gly Cys Ser Leu Arg Tyr He Tyr Arg 

485 490 495 

Thr Leu Leu Met Lys Asp Val Lys Leu Thr Val Ala Cys His Phe Ser 

500 505 510 

Leu Lys Thr Asn Asp Ser Val Asn Phe Phe Lys Val Trp Gin Pro Asp 
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515 520 525 

Glu Asn Phe Ser Phe Glu Tyr Asp Asp Gly Met Arg Ala Thr Val Thr 

530 535 540 

Thr Glu Asn Ser Thr Glu Ser Arg Cys Phe Leu Leu Arg Thr Thr Glu 
545 550 555 560 

Glu Asp Thr Gly Glu Asn Asp Trp He Thr Lys Thr He Asn Val Pro 

565 570 575 

Ala Val Pro Glu Gly Ser Gin Leu Tyr He Thr Arg Leu Glu Val Ser 

580 585 590 

Val Val Leu Asp Thr Ala Gly Leu Val Gly Leu Val Asn Gin Val He 

595 600 605 

Ala Cys Leu Gly Tyr He Ser He He Pro Thr He Asn Ser Gly He 

610 615 620 

Lys Thr Asp Ser Ser Arg He He Gin Asp Leu Phe Trp Lys Asp Gin 
625 630 635 640 

Lys Tyr Thr Lys He Gly Lys Glu Ser Leu Asp Asp He Ala Gin Glu 

645 650 655 

Glu Val His Arg Tyr Tyr Gly Thr Leu Asn Trp Glu Asn Thr Ala Asn 

660 665 670 

Val Val Asn Ala Trp Glu Glu He Asp Tyr Tyr Asn Val Phe Tyr Lys 

675 680 685 

Glu Ser Asp Asp Ser Ala Thr Arg He Phe Leu Gly Thr Ala Phe Cys 

690 695 700 

Asn Gin Phe Arg Val Ser Gly Leu Asp He He Leu Ser Lys Leu Pro 
705 710 715 720 

Lys He Val He Glu Ala Val Asn Lys Glu Gly Tyr He Ser Ser Ser 

725 730 735 

Gly Ser He Asp Leu Ser Leu Asn 
740 
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<210> 4 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> partial amino acid sequence of endo-p-N-acetylglucosamini 
<400> 4 

Pro Ser Leu Gin Leu Gin Pro Asp Asp Lys 
1 5 10 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the partial amino acid 
sequence of endo-p-N-acetylglucosaminidase 

<220> 

<221> modif ied_b ase 
<222> 12 

<223> n represents a, g, c or t 
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<400> 5 

carttrcarc cngaygayaa 



20 



<210> 6 
<211> 15 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> partial amino acid sequence of endo-p-N-acetylglucosaminidase 
<400> 6 

Ser Tyr Arg Asn Pro Glu He Tyr Pro Thr Asp Gin Asn He Lys 
15 10 15 



<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the partial amino acid 
sequence of endo-p-N-acetylglucosaminidase 

<220> 

<221> mod if ied_base 
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<222> 6 

<223> n represents a, g, c or t 
<400> 7 

cchacngayc araayatyaa 

<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the partial amino acid 
sequence of endo-p-N-acetylglucosaminidase 

<220> 

<221> mod if ied_base 
<222> 15 

<223> n represents a, g, c or t 
<400> 8 

ttratrttyt grtcngtdgg 



<210> 9 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> partial amino acid sequence of endo-p-N-acetylglucosaminidase 
<400> 9 

Gly Gin Arg Phe Asn His Arg Glu Ser His Asp Val Glu Thr Glu He 
15 10 15 



<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the partial amino acid 
sequence of endo-p-N-acetylglucosaminidase 

<400> 10 

tgrttraadc gytgdccytt 20 

<210> 11 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide sequence of 5' terminal region of 
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endo-p-N-acetylglucosaminidase gene 
<400> 11 

atgccttcac ttcaattgca acc 

<210> 12 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide sequence of 3' terminal region of 
endo-p-N-acetylglucosaminidase gene 

<400> 12 

ctagtttaat gacaaatcta tgc 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the sequence of 
endo-p-N-acetylglucosaminidase gene 

<400> 13 
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cacttaagtc tatgaatgag 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the sequence of 
endo-p-N-acetylglucosaminidase gene 

<400> 14 

aatctctgga tttcgatagc 

<210> 15 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the sequence of 
endo-p-N-acetylglucosaminidase gene 

<400> 15 

ggggcggccg cttttatttt acataaatat gccttcactt c 
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<210> 16 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide designed based on the sequence of 
endo-p-N-acetylglucosaminidase gene 

<400> 16 

cccgcggccg cctagtttaa tgacaaatct atgctacc 



<210> 17 
<211> 10 
<212> PRT 

<213> Mucor hiemalis 



<400> 17 

Pro Ser Leu Gin Leu Gin Pro Asp Asp Lys 
1 5 10 



<210> 18 
<211> 16 
<212> PRT 

<213> Mucor hiemalis 
<400> 18 
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Lys Ser Tyr 
1 



Arg Asn Pro Glu lie Tyr Pro Thr Asp Gin Asn lie Lys 
5 10 15 



<210> 19 
<211> 14 
<212> PRT 

<213> Mucor hiemalis 
<400> 19 

Lys Phe Asn Val Ser Ser Val Ala Leu Gin Pro Arg Val Lys 
1 5 10 



<210> 20 
<211> 11 
<212> PRT 

<213> Mucor hiemalis 
<220> 

<221> PEPTIDE 
<222> 2 

<223> Xaa represents Met or Ser 
<400> 20 

Lys Xaa Asp Arg Leu Phe Leu Cys Gly Gly Lys 
1 5 10 
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<210> 21 
<211> 17 
<212> PRT 

<213> Mucor hiemalis 
<220> 

<221> PEPTIDE 
<222> 2 

<223> Xaa represents Gly or Met 
<220> 

<221> PEPTIDE 
<222> 3 

<223> Xaa represents Gin or Ala 
<220> 

<221> PEPTIDE 
<222> 4 

<223> Xaa represents Arg or Leu 
<220> 

<221> PEPTIDE 
<222> 6 

<223> Xaa represents Asn or Pro 
<220> 

<221> PEPTIDE 
<222> 8 

<223> Xaa represents Arg or Leu 
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<220> 

<221> PEPTIDE 
<222> 9 

<223> Xaa represents Glu or Leu 
<220> 

<221> PEPTIDE 
<222> 10 

<223> Xaa represents Ser or Leu 
<220> 

<221> PEPTIDE 
<222> 11 

<223> Xaa represents His or Thr 
<400> 21 

Lys Xaa Xaa Xaa Phe Xaa His Xaa Xaa Xaa Xaa Asp Val Glu Thr Glu 

15 10 15 

He 



<210> 22 
<211> 16 
<212> PRT 

<213> Mucor hiemalis 
<400> 22 

Lys Glu Gly Tyr He Ser Ser Ser Gly Ser He Asp Leu Ser Leu Asn 
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1 



5 



10 



<210> 23 
<211> 9 
<212> PRT 

<213> Mucor hiemalis 
<400> 23 

Lys Asn lie Gin Gly Asn Asn Tyr Lys 
1 5 



<210> 24 
<211> 11 
<212> PRT 

<213> Mucor hiemalis 
<400> 24 

Lys Tyr Ser Asp Tyr Pro Pro Pro Pro Pro Lys 
1 5 10 



<210> 25 
<211> 8 
<212> PRT 

<213> Mucor hiemalis 
<400> 25 
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Lys Leu Ser Leu Asp Ala Ser Lys 
1 5 



<210> 26 
<211> 13 
<212> PRT 

<213> Mucor hiemalis 
<400> 26 

Lys Asn Thr Asp Gly He Phe Leu Asn Tyr Trp Trp Lys 
1 5 10 



<210> 27 
<211> 15 
<212> PRT 

<213> Mucor hiemalis 
<220> 

<221> PEPTIDE 
<222> 3 

<223> Carboxymethylcystein 
<400> 27 

Lys Gly Cys Ser Leu Arg Tyr He Tyr Arg Thr Leu Leu Met Lys 
15 10 15 
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<210> 28 
<211> 7 
<212> PRT 

<213> Mucor hiemalis 
<220> 

<221> PEPTIDE 
<222> 6 

<223> Carboxymethylcystein 
<400> 28 

Lys Leu Thr Val Ala Cys His 
1 5 



<210> 29 
<211> 15 
<212> PRT 

<213> Mucor hiemalis 
<400> 29 

Lys Pro Gin Leu Leu Leu Thr His Asp Met Ala Gly Gly Tyr Lys 
15 10 15 



<210> 30 
<211> 14 

<212> PRT 

<213> Mucor hiemalis 
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<400> 30 

Lys Ser Met Asn Glu Leu Arg Asp Trp Thr Pro Asp Glu Lys 
1 5 10 



<210> 31 
<211> 10 
<212> PRT 

<213> Mucor hiemalis 
<400> 31 

Lys Leu Ala Pro Val Ser Phe Ala Leu Lys 
1 5 10 



<210> 32 
<211> 23 
<212> PRT 

<213> Mucor hiemalis 
<400> 32 

Lys Gly Gin Arg Phe Asn His Arg Glu Ser His Asp Val Glu Thr Glu 
15 10 15 

lie Ser He Pro Leu Tyr Lys 
20 
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<210> 33 
<211> 22 
<212> PRT 

<213> Mucor hiemalis 
<220> 

<221> PEPTIDE 
<222> 8 

<223> Carboxymethylcystein 
<400> 33 

Lys He Thr Ser Ser Leu Asp Cys Asp His Gly Ala Phe Leu Gly Gly 

15 10 15 

Thr Ser Leu He He Lys 
20 



<210> 34 
<211> 19 
<212> PRT 

<213> Mucor hiemalis 
<400> 34 

Lys Asn Glu Leu Phe Phe Lys Asn Thr Asp Gly He Phe Leu Asn Tyr 

15 10 15 

Trp Trp Lys 

<210> 35 
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<211> 9 
<212> PRT 

<213> Mucor hiemalis 
<400> 35 

Lys He Val He Glu Ala Val Asn Lys 
1 5 



<210> 36 
<211> 11 
<212> PRT 

<213> Mucor hiemalis 
<400> 36 

Ser Ser Arg He He Gin Asp Leu Phe Trp Lys 
1 5 10 



<210> 37 
<211> 14 
<212> PRT 

<213> Mucor hiemalis 
<400> 37 

Lys Thr Asp Ser Ser Arg He He Gin Asp Leu Phe Trp Lys 
1 5 10 
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